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Guest  Editorial 


The  Fifth  International  Workshop  on  Computational  Electronics  (IWCE-5)  was  held  on  the  campus  of 
the  University  of  Notre  Dame,  May  28-30,  1997.  As  in  previous  IWCE  meetings,  the  workshop  covered 
all  aspects  of  advanced  simulation  of  electronic  transport  in  semiconductors  and  semiconductor  devices, 
especially  those  which  use  large  computational  resources.  In  particular,  IWCE-5  focused  on  the  following 
three  major  themes:  (i)  Device  Simulation,  (ii)  Optoelectronics,  and  (iii)  Quantum  Simulation.  The 
scientific  program,  organized  in  a  single-session  format,  consisted  of  invited  lectures,  contributed  talks, 
and  poster  presentations.  IWCE-5  attracted  some  170  registered  participants,  and  featured  15  invited 
speakers,  40  oral  presentations,  and  75  poster  papers.  As  a  novelty,  IWCE-5  included  an  evening  session 
with  hands-on  software  demonstrations. 

The  present  workshop  is  the  fifth  in  a  series  of  international  meetings  which  evolved  from  earlier 
workshops  held  under  the  auspices  of  the  NSF-sponsored  National  Center  for  Computational  Electronics 
(NCCE)  at  the  Beckman  Institute  of  the  University  of  Illinois  at  Urbana-Champaign.  The  first 
International  Workshop  on  Computational  Electronics  was  held  there  on  May  28-29,  1992.  IWCE-2  was 
held  at  the  University  of  Leeds  (UK)  on  August  1 1-13,  1993.  The  third  workshop  took  place  in  Portland, 
Oregon,  on  May  18-20,  1994,  and  IWCE-4  was  held  in  Tempe,  Arizona,  on  October  30-November  1, 
1995. 

We  are  grateful  for  financial  sponsorship  of  IWCE-5  by  the  National  Science  Foundation,  the  Office  of 
Naval  Research,  the  Army  Research  Office,  and  the  National  Center  for  Computational  Electronics,  as 
well  as  local  support  by  the  Notre  Dame  Department  of  Electrical  Engineering  and  the  College  of 
Engineering.  Technical  co-sponsorship  was  provided  by  the  IEEE  Electron  Devices  Society. 

We  gratefully  acknowledge  the  conference  committees  for  their  help  in  the  workshop  organization.  The 
International  Advisory  Committee  provided  guidance  and  gave  shape  to  IWCE-5  through  the  nomination 
of  session  topics  and  invited  speakers.  The  Program  Committee  selected  contributed  and  poster  papers 
from  among  the  submitted  abstracts. 

Finally,  I  would  like  to  thank  all  the  participants  for  their  contributions  and  for  helping  to  make 
IWCE-5  a  success! 


Wolfgang  Porod 
University  of  Notre  Dame 
May  1997 
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Two-dimensional  Carrier  Transport  in  Si  MOSFETs 

SHIN-ICHI  TAKAGI 

Advanced  Semiconductor  Devices  Research  Laboratories,  Toshiba  Corporation, 

8,  Shinsugita-cho,  Isogo-ku,  Yokohama,  Japan  235 


The  importance  of  2-dimensional  (2D)  features  of  carriers  in  Si  MOSFETs  on  the  device 
performance  is  re-examined  experimentally  and  theoretically  from  the  viewpoint  of  low- 
field  mobility,  velocity  in  high  tangential  fields  and  the  inversion-layer  capacitance.  It  is 
confirmed  that  low-field  mobility  and  inversion-layer  capacitance  can  be  understood 
well  in  terms  of  the  2D  subbands  and  the  2D  carrier  transport.  In  order  to  obtain  fully- 
quantitative  understanding  of  low-field  mobility,  however,  it  is  still  necessary  to  more 
accurately  determine  the  amount  of  the  scattering  parameters  in  the  inversion  layer.  On 
the  other  hand,  saturation  velocity  is  considered  to  be  less  influenced  by  the  2D 
quantization,  while  it  is  found  experimentally  that  saturation  velocity  is  slightly 
dependent  on  surface  carrier  concentration. 

According  to  the  knowledge  of  2-dimensional  carrier  transport  in  Si  inversion  layer, 
an  effective  way  to  have  higher  current  drive  is  to  increase  the  occupancy  of  the  2-fold 
valleys,  which  have  lower  conductivity  mass,  on  a  (100)  surface.  From  this  viewpoint, 
two  device  structures,  strained  Si  MOSFETs  and  SOI  MOSFETs  with  ultra- thin  SOI 
films,  are  introduced  and  the  behavior  of  low-field  mobility  is  analyzed  through  the 
calculations  of  the  subband  structures  and  phonon-limited  mobility. 


Keywords:  MOSFET,  Si,  carrier  transport,  mobility,  carrier  velocity,  inversion  layer,  2-dimen¬ 
sional  carrier  gas,  subband  structure,  strain,  SOI 


1.  INTRODUCTION 

It  is  well  recognized  that  2-dimensional  (2D) 
features  of  carriers  significantly  affect  the  carrier 
transport  in  the  inversion  layer  of  Si,  which 
determines  the  drain  current  of  MOSFETs.  The 
influence  of  the  2D  quantization  on  the  I-V 
characteristics  of  MOSFETs  appears  typically  in 
the  following  ways.  (1)  Mobility  is  modulated 
through  the  change  of  the  scattering  probability 
due  to  the  differences  in  the  density-of-states,  the 


form  factor  of  the  envelope  function  of  2D  carriers 
and  the  scattering  parameters.  (2)  Gate  capacitance 
is  decreased  by  the  inversion-layer  capacitance, 
which  is  determined  by  the  finite  thickness  of  the 
envelope  function.  (3)  The  threshold  voltage  of 
MOSFETs  is  increased  by  the  existence  of  the 
subband  energy.  The  influences  of  (1)  and  (3)  are 
pronounced  more  with  an  increase  in  the  substrate 
impurity  concentration  of  MOSFETs  and  the 
influence  of  (2)  is  pronounced  more  with  an  decrease 
in  the  gate  oxide  thickness.  Thus,  the  accurate 
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consideration  of  the  2D  quantization  is  quite 
important  to  quantitatively  understand  the  electri¬ 
cal  properties  of  scaled  MOSFETs. 

Many  theoretical  and  experimental  works  [1] 
have  so  far  been  done  on  the  characterization  of  the 
2D  carrier  transport  and,  as  a  result,  the  under¬ 
standing  of  the  2D  quantization  effects  on  the 
MOSFETs  performance  has  been  basically  ob¬ 
tained,  particularly  at  low  temperatures.  It  seems, 
however,  that  the  direct  experimental  evidence  to 
verify  the  contribution  of  the  2D  quantization  at 
room  temperature  is  still  lacking  and  the  quantita¬ 
tive  description  has  not  been  fully  obtained  yet.  In 
this  paper  we  re-examine  the  influence  of  the  2D 
quantization  on  the  basic  physical  quantities  to 
determine  I-V  characteristics  of  MOSFETs,  the 
mobility  and  the  saturation  velocity,  vsat,  in  the 
inversion  layer,  and  the  inversion-layer  capacitance, 
^inv>  through  the  systematic  experiments.  In  addi¬ 
tion,  the  analysis  based  on  the  self-consistent 
subband  calculations  is  performed  to  explain  the 
experimental  results.  The  present  understanding 
and  the  questions  still  unsolved  are  presented  with 
comparing  the  self-consistent  subband  calculations. 

Moreover,  based  on  the  knowledge  of  the  2D 
carrier  transport  in  the  inversion  layer  of  Si,  a 
subband  engineering  scenario  to  obtain  the  higher 
performance  of  Si  MOSFETs  is  introduced. 


2.  SUBBAND  STRUCTURE  OF  Si  MOS 
INTERFACE 

Figure  1  shows  the  schematic  diagram  of  the 
subband  structure  for  inversion-layer  electrons  on 
a  (100)  surface,  on  which  MOSFETs  are  com¬ 
monly  fabricated.  Electrons  confined  near  Si/Si02 
interface  by  normal  electric  field  are  quantized  in 
the  direction  perpendicular  to  the  interface  and 
grouped  into  electronic  subbands  as  2D  Electron 
Gas  (2DEG).  A  characteristic  point  of  the  sub¬ 
band  structure  on  (100)  is  that  electrons  in  the 
inversion  layer  are  classified  into  two  electronic 
systems,  the  2-fold  degenerate  valleys  and  the  4- 
fold  degenerate  valleys,  as  shown  in  Figure  1. 


*  lower  conductivity  mass  (0.1 9m0)  »  higher  conductivity  mass  (0.31 5m0) 

higher  mobility  -*  lower  mobility 

•  higher  m2  (0.91 6mc)  *  lower  mz  (0.1 9m0) 

->  thinner  inversion  layer  ->  thicker  inversion  layer 

lower  subband  energy  -*  higher  subband  energy 

FIGURE  1  Schematic  diagram  of  the  subband  structures  for 
electrons  on  a  (100)  surface  and  the  characteristics  of  the  2-fold 
and  the  4-fold  valleys. 

Electrons  in  the  2-fold  valleys  have  higher  mobility 
due  to  smaller  conductivity  mass  and  thinner 
inversion-layer  thickness  due  to  higher  effective 
mass  perpendicular  to  the  Si/Si02  interface  than 
those  in  the  4-fold  valleys.  As  a  result,  the 
electrical  properties  of  inversion-layer  electrons 
on  (100)  need  to  be  understood  as  the  weighed 
combination  over  these  two  electronic  systems. 

3.  IMPACT  OF  2D  CARRIER  GAS  ON 
CARRIER  TRANSPORT  PROPERTIES 
IN  MOSFETs 

3.1.  Low  Field  Mobility 

Low  field  mobility  in  Si  MOSFETs  has  so  far  been 
studied  most  extensively  as  the  2-dimensional 
carrier  transport  from  both  the  theoretical  and 
the  experimental  viewpoints.  Recently,  the  inver¬ 
sion-layer  mobility  has  come  to  be  explained  in 
terms  of  the  scattering  theory  for  the  2D  carrier 
gas  [2-6],  even  at  room  temperature.  Figure  2 
shows  the  experimental  relationship  between 
electron  mobility  on  (100)  and  the  normal  effective 
field,  as  a  parameter  of  the  substrate  impurity 
concentration,  N'a-  Eeff  is  defined  by  q(NdpX  +  7]Ns)/ 
eSi),  where  Adpl  is  the  surface  concentration  of  the 
space  charge  in  the  depletion  region,  Ns  is  the 
surface  carrier  concentration  and  eSi  is  the 
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permittivity  of  Si.  One  evidence  of  the  2D  quantiza¬ 
tion  effects  on  the  inversion-layer  mobility  appears 
as  its  dependence,  which  is  the  origin  of  the 
“universal  curve”.  The  universality  of  the  inversion- 
layer  mobility  for  EQ^,  typically  seen  in  Figure  2,  has 
been  verified  experimentally  over  a  wide  range  of  the 
substrate  impurity  concentration  for  n-  and  p- 
MOSFETs  [7-11]  fabricated  on  several  surface 
orientations  [12].  The  EQ ff  dependence  in  low  and 
moderate  £e{f  region,  which  is  roughly  proportional 
to  E-J\  is  thought  to  be  attributable  to  phonon 
scattering  for  2DEG,  while  the  stronger  iseff 
dependence  in  higher  iseff  region  is  influenced  by 
surface  roughness  scattering.  The  mobility  limited 
by  intravalley  acoustic  phonon  scattering  for  2DEG 
is  represented  under  the  approximation  of  the  single 
subband  occupation  [12,  13]  by 

„  =  9  tA _ W 

ac  mc  man,,. DACk%T 

^  q_  tfptf  / 256eSjfi2  \ 1/3  ^ 

~  mc  manvDacksT  \  81^3 q2  )  ' 

{  i  i  \  — 1/3 

•(Mipi+^A'sJ  oc 


0.1  1.0 


EFFECTIVE  FIELD  [  MV/cm  ] 


FIGURE  2  Relationship  between  inversion-layer  electron 
mobility  on  (100)  and  the  effective  normal  field,  2seff,  as  a 
parameter  of  the  substrate  impurity  concentration.  iseff  is 
defined  by  q(Ndp]  +  As/2)/eSi  (7?=  1/2),  where  iVdpi  is  the 
surface  concentration  of  the  space  charges  and  Ns  is  the  surface 
carrier  concentration. 


mc  is  conductivity  mass,  md  is  density-of-states 
mass,  nv  is  the  valley  degeneracy,  Dac  is  the 
deformation  potential  of  acoustic  phonon,  s\  is 
sound  velocity.  This  EQ^  dependence  originates  in 
the  fact  that  the  mobility  is  in  proportion  to  the 
inversion-layer  thickness,  which  is  proportional  to 
E~e }1/3  under  77  of  11/32.  Since  the  inverse  of  the 
inversion-layer  thickness  determines  the  energy 
band  width  of  acoustic  phonon  that  can  couple 
with  2DEG,  the  thinner  inversion  layer  leads  to 
the  higher  scattering  rate  with  acoustic  phonon. 

In  order  to  more  quantitatively  describe  mobility 
at  room  temperature,  however,  it  is  necessary  to 
take  the  contributions  of  all  the  subbands  and  the 
interactions  with  intra-  and  intervalley  phonons 
into  consideration.  At  present,  the  amount  of  the 
phonon-limited  mobility  and  its  iseff  dependence 
seems  not  to  have  been  perfectly  represented  by  the 
theoretical  calculations  [3,  4,  6].  Figures  3  and  4 
show  the  Ecff  dependence  and  the  temperature 
dependence  of  the  experimental  and  the  calculated 
phonon-limited  electron  mobility,  respectively.  The 
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FIGURE  3  Calculated  phonon-limited  mobility  in  the  inver¬ 
sion  layer  at  room  temperature  as  a  function  of  £efr-  Three 
parameters  sets  of  intervalley  scattering,  set  A,  B  and  C,  which 
are  listed  in  Table  I,  are  compared  with  the  experimental  results, 
which  are  shown  as  closed  circles. 
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TEMPERATURE [K] 


FIGURE  4  Calculated  phonon-limited  mobility  in  the  inver¬ 
sion  layer  at  Ns  of  lxl(r2  cm-2  as  a  function  of  temperature. 
Three  parameters  sets  of  intervalley  scattering,  set  A,  B  and  C, 
which  are  listed  in  Table  I,  are  compared  with  the  experimental 
results,  which  are  shown  as  closed  circles. 


mobility  calculation  was  carried  out  under  the 
relaxation  time  approximation,  based  on  the  results 
of  the  self-consistent  subband  calculations  [14] 
including  20  subbands.  Three  different  parameter 
sets  of  intervalley  phonon  scattering,  which  are 
listed  in  Table  I,  are  used  in  the  calculations.  It  is 
found  that  the  amount  of  the  mobility  calculated 
under  the  bulk  phonon  scattering  parameters 
(parameter  set  A  [15]  and  B  [16])  is  higher  than  that 
of  the  experimental  one  and  the  calculated  tem¬ 


perature  dependence  is  weaker  than  the  experimen¬ 
tal  one. 

One  possible  origin  of  this  discrepancy  might  be 
the  difference  in  the  scattering  parameters  in  the 
inversion  layer  from  those  in  bulk,  which  might  be 
partly  due  to  the  difference  of  phonons  relevant  to 
intervalley  scattering  between  3DEG  and  2DEG, 
coming  from  the  selection  rule  in  the  wave-vector 
conservation.  As  one  example  of  the  different 
scattering  parameters,  a  parameter  set  including 
larger  deformation  potential  of  intervalley  f- 
phonon  scattering  (parameter  set  C)  was  used 
for  the  mobility  calculation.  The  results  are  also 
shown  in  Figures  3  and  4.  It  is  found  that  the 
higher  coupling  with  intervalley  phonons  than  in 
bulk  can  provide  better  agreement  with  the 
experimental  results  for  both  the  magnitude  of 
mobility  and  its  temperature  dependence.  Further¬ 
more,  as  described  later,  the  analysis  of  the 
mobility  enhancement  in  strained  Si  MOSFETs 
also  suggests  the  higher  coupling  between  2DEG 
and  intervalley  phonons.  The  mobility  enhance¬ 
ment  factor,  the  ratio  of  the  mobility  in  strained  Si 
MOSFETs  to  that  in  conventional  Si  MOSFETs, 
is  also  explained  well  by  assuming  the  higher 
coupling  with  intervalley  phonons  (see  Fig.  8).  In 
order  to  more  quantitatively  obtain  the  values  of 
the  scattering  parameters  in  the  inversion  layer, 
other  experimental  evidences  through  the  direct 
evaluation  of  the  scattering  parameters  are  re¬ 
quired.  On  the  other  hand,  even  if  the  parameter 
set  C  is  used,  the  calculated  £eff  dependence  is  still 
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TABLE  I  Parameters  for  intervalley  phonon  scattering  models.  Ek  and  Dk  are  the  values  of  phonon  energy  and  deformation 
potential,  respectively.  The  parameters  of  models  A  and  B  were  taken  from  [15]  and  [16],  respectively 


parameter  model 

type  of  intervalley  scattering 

Ek[meV] 

Dk[xl08[eV/cm] 

19.0 

0.3 

model  A 

f 

47.5 

2.0 

59.1 

2.0 

(Jacoboni) 

12.1 

0.5 

g 

18.6 

0.8 

62.2 

11.0 

model  B 

f 

59.0 

8.0 

(Y amada  and  Ferry) 

g 

63.0 

8.0 

model  C 

f 

59.0 

11.0 

(this  work) 

g 

63.0 

8.0 
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weaker  than  the  experimental  one.  Further  refine¬ 
ments  on  the  phonon  scattering  model  including 
the  effect  of  the  anisotropic  deformation  potential 
[4,  17,  18]  are  also  expected  to  provide  better 
agreement. 

Another  unclear  point  regarding  the  universal 
mobility  curve  is  the  origin  of  77  in  iseff.  It  has 
already  been  reported  that  the  value  of  77  is  1/2  for 
electrons  on  (100)  [7-9,  11]  and  1/3  for  holes  on 
(100)  [10,  11]  at  room  temperature.  Furthermore, 
it  has  also  been  found  [12]  that  the  value  of  77  is  1/3 
for  electrons  on  (110)  and  (111),  suggesting  that 
the  subband  structure  can  affect  the  value  of  77 
significantly.  Although,  according  to  (1),  77  is 
roughly  1/3,  the  only  one  subband  is  assumed  for 
this  equation.  Thus,  the  detailed  analysis  of  77 
including  all  the  subbands  and  the  contributions  of 
intra-  and  inter- valley  scattering  [19]  are  required 
to  clarify  the  origin. 

3.2.  Saturation  Velocity 

Saturation  velocity,  vsat,  in  the  inversion  layer  is  a 
quite  important  physical  quantity  in  determining 
the  drain  current  in  ultra-short  channel  MOS¬ 
FETs.  It  seems,  however,  that  the  magnitude  of 
vsat  in  the  inversion  layer  and  the  dependences  on 
the  physical  parameters  such  as  2?eff>  Ns  and  Thave 
not  been  fully  established  yet,  partly  because  the 
large  variation  is  seen  in  the  measured  values 
among  the  references  [20-22].  While  several  data 
have  suggested  that  vsat  in  the  inversion  layer  is 
lower  than  that  in  bulk,  it  has  not  been  clarified 
whether  the  2D  quantization  at  the  Si/Si02  inter¬ 
face  has  any  influence  on  vsat  in  the  inversion  layer. 
Thus,  the  relationship  between  the  carrier  velocity 
in  the  inversion  layer  and  the  tangential  electric 
field  has  been  evaluated,  using  MOSFETs  with  the 
resistive  gate  [20],  in  order  to  study  the  effect  of  the 
2D  quantization  on  vsat.  Figure  5  shows  the 
experimental  v-E  curves  as  a  parameter  of  Ns. 
The  schematic  cross  section  of  the  device  structure 
is  shown  as  the  inset  of  Figure  5.  It  is  observed  that 
vsat  is  dependent  on  Ns  and  decreases  from  the 
value  in  bulk  Si,  lxlO7  cm/s,  with  increasing  jVs, 


TANGENTIAL  FIELD  [  kV/cm  ] 


FIGURE  5  Experimental  results  of  the  electron  velocity - 
electric  field  curves  in  the  inversion  layer  at  room  temperature 
as  a  function  of  Ns.  The  direction  of  the  electric  field  is  parallel 
to  (1 10)  axis.  The  inset  figure  shows  the  schematic  cross  section 
of  resistive  gate  MOSFETs  to  evaluate  carrier  velocity  in  the 
inversion  layer.  In  order  to  realize  the  uniform  Ns  along  the 
channel,  the  voltage  of  Vg\  +  Vd  is  applied  to  the  terminal  Vg2. 

though  the  saturation  of  v  is  not  obtained  at  Ns  of 
lxlO12  cm-2. 

One  of  the  most  important  points  in  experimen¬ 
tally  evaluating  vsat  in  the  inversion  layer  is  to 
realize  the  uniform  distribution  of  Ns  and  the 
electric  field  along  the  channel  of  MOSFETs.  Note 
that  the  velocity  determined  from  the  drain  current 
in  conventional  MOSFETs  is  inaccurate  because 
of  the  non-uniformity  of  Ns  along  the  channel, 
unless  MOSFETs  with  sufficiently  thick  gate 
oxides  or  SOI  MOSFETs  with  sufficiently  thick 
buried  oxides  are  used  [22].  The  uniformity  of  Ns 
along  the  channel  in  the  high-resistive  gate 
MOSFETs  shown  in  Figure  5  has  been  examined 
[23],  using  device  simulations.  As  a  consequence,  it 
was  found  that,  although  a  distribution  of  Ns 
along  the  channel  is  observed,  the  Ns  dependence 
of  vsat  in  Figure  5  cannot  be  explained  only  by  this 
non-uniformity  of  NSi  but  this  Ns  dependence  is 
provided  by  some  feature  inherent  to  carriers  in 
the  channel  of  MOSFETs. 

In  order  to  examine  whether  the  lowering  of  vsat 
with  an  increase  in  Ns  is  attributable  or  not  to  the 
2-dimensional  properties  of  carriers,  the  substrate 
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bias  dependence  of  the  velocity  was  measured.  It 
should  be  noted  that  the  Ns  dependence  of  vsat  in 
Figure  5  can  include  both  effects  of  the  normal 
electric  field  and  the  carrier  concentration,  because 
Fg  was  simply  changed  to  control  Ns.  The 
application  of  both  Fg  and  Fsub  allows  to  control 
Ns  and  separately.  If  the  2D  quantization  plays 
an  important  role  in  the  lowering  of  vsat,  Ee ff  is 
expected  to  determine  vsat  in  the  inversion  layer. 
Figure  6  shows  the  v-E  curves  in  the  inversion 
layer  with  and  without  the  substrate  bias.  The  gate 
voltage  in  the  curve  B  was  adjusted  so  as  to  make 
£eff  in  the  curve  B  identical  to  that  in  the  curve  C, 
where  the  substrate  bias  of  —6  V  was  applied.  On 
the  other  hand,  the  curve  A  and  the  curve  C  have 
the  same  Ns.  It  is  found  in  Figure  6  that  the 
velocity  in  the  curve  A  is  almost  the  same  as  in  the 
curve  C  and,  thus,  the  velocity  is  determined  not 
by  2ieff  but  by  Ns  itself.  This  fact  means  that  the 
lowering  of  vsat  with  an  increase  in  Ns  might  be 
attributable  not  to  the  2-dimensional  properties  of 
carriers,  but  to  an  effect  inherent  to  Ns  or  carrier 
concentration  itself  like  carrier -carrier  scattering. 

One  possible  mechanism  responsible  for  the  Ns 
dependence  of  vsat  might  be  plasmon  scattering 
[24].  If  plasmon  decays  into  electrons  through  the 
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FIGURE  6  Electron  velocity -electric  field  curves  with 
(Curve  C)  and  without  (Curve  A  and  B)  the  substrate  bias, 
Ksub.  While  the  value  of  Ns  is  in  Curve  C  is  the  same  as  in  Curve 
A ,  the  values  of  in  Curve  C  is  the  same  as  in  Curve  B. 


emission  of  phonons,  this  process  could  become 
the  energy  dissipation  mechanism  of  hot  carriers 
under  the  high  electric  field.  Note  that  the  plasmon 
energy  with  the  carrier  concentration  of  5xl018 
cm-3,  which  is  typically  seen  in  the  inversion  layer, 
is  40  meV,  which  is  almost  the  same  as  the  energies 
of  intervalley  phonons  seen  in  Table  I.  As  the 
carrier  concentration  increases,  the  plasmon  en¬ 
ergy  increases  and  the  possibility  of  the  energy  loss 
due  to  the  phonon  emission  might  also  increase. 
Further  experimental  evidence  and  theoretical 
verification  on  the  relevance  of  plasmon  with  the 
energy  loss  of  hot  carriers  in  the  inversion  layer  are 
strongly  required. 

4.  INVERSION-LAYER  CAPACITANCE 

One  of  the  most  crucial  limitations  in  the 
miniaturization  of  MOSFETs  is  that  the  gate 
capacitance  cannot  be  sufficiently  increased  by 
thinning  the  gate  oxide,  because  of  the  existence  of 
the  inversion-layer  capacitance,  Cinv  [25-27]. 
Since  the  total  gate  capacitance,  Ciot,  is  described 
by  Cox/(l  +  Cox/Cinv),  the  influence  of  Cjnv  be¬ 
comes  more  serious  with  reducing  the  oxide 
thickness.  Thus,  it  is  quite  important  to  clarify 
the  origin  of  Qnv  and  to  quantitatively  evaluate 
Cinv.  Figure  7  shows  the  schematic  diagram  of  Cinv 
as  a  function  of  Ns  under  the  simple  analytical 
models  [27],  suggesting  that  the  effect  of  the  2D 
quantization  on  Cinv  is  dominant  in  higher  Ns 
region.  The  reason  why  the  2D  quantization  is 
important  on  Cjnv  is  that  the  finite  inversion-layer 
thickness,  which  effectively  works  as  the  series 
capacitance  to  the  oxide  capacitance,  is  much 
thicker  in  2D  carrier  gas  than  in  bulk  carriers.  In 
the  model  of  the  single  subband  occupation,  Qnv 
due  to  the  finite  inversion-layer  thickness  under 
the  2D  quantization  can  be  simply  described  by 
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FIGURE  7  Schematic  diagram  of  the  physical  origins  of  Cinv 
and  the  N&  dependence  of  Cinv  under  the  simple  analytical 
models  [27]. 


Here,  Zinv  is  the  average  thickness  of  2DEG,  h  is 
the  Plank  constant  and  m3  is  the  effective  mass  of 
Si  perpendicular  to  the  Si/Si02  interface.  Accord¬ 
ing  to  this  formulation,  Cinv  should  have  the 
dependence  on  m3.  Figure  8  shows  the  experi¬ 
mental  Cinv  at  room  temperature  as  a  function  of 


FIGURE  8  Experimental  Cinv  -  As  curves  for  electrons  in  the 
inversion  layer  on  (100),  (110)  and  (111)  surfaces  at  298  K.  The 
value  of  effective  mass  perpendicular  to  the  interface,  m 3,  is 
described  for  each  surface  orientation. 


Ns  for  (100),  (110)  and  (111)  surfaces,  which  have 
the  different  values  of  m3.  It  is  found  that  the 
measured  values  of  Cmv  become  smaller  in  the 
order  of  (100),  (110)  and  (111).  Actually,  it  is 
confirmed  that  the  surface  orientation  dependence 

1  /3 

of  Cinv  is  explained  quantitatively  by  the  m3/ 
dependence.  This  result  is  the  direct  experimental 
evidence  for  the  fact  that  Cinv  at  room  temperature 
is  determined  by  the  quantum  mechanical  inver¬ 
sion-layer  thickness,  because  the  surface  orienta¬ 
tion  does  not  make  any  difference  for  3D  carrier 
gas.  This  result  also  means  that  a  larger  value  of 
m3  leads  to  the  suppression  of  the  degradation  of 
the  gate  capacitance  due  to  Cinv. 

Figure  9  shows  the  experimental  and  the 
calculated  Cinv  for  inversion-layer  electrons  on 
(100)  at  room  temperature.  The  calculated  Cjnv 
was  determined  directly  from  d^s/dFg  through  the 
self-consistent  subband  calculations  without  using 
any  definition  of  the  inversion-layer  thickness  [27]. 
It  is  confirmed  that  Cinv  is  accurately  represented 
by  the  calculation,  meaning  that  Cjnv  is  quantita¬ 
tively  understood  even  at  room  temperature  in 
terms  of  the  2D  subband. 


Ns  [  cm'2  ] 


FIGURE  9  Comparison  in  Cinv-Ns  curves  on  (100)  surface  at 
298  K  between  the  self-consistent  calculation  and  the  experi¬ 
mental  results.  The  open  circles  and  the  closes  circles  represent 
the  results  before  and  after  the  influence  of  the  depletion  layer 
in  the  poly-Si  gate  was  subtracted  from  the  experimental  data. 
In  the  self-consistent  subband  calculation,  20  subbands  (10 
subbands  for  each  subband  ladder)  were  taken  into  account. 
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5.  SUBBAND  ENGINEERING  FOR  HIGHER 
PERFORMANCE  MOSFETs 

In  order  to  obtain  the  higher  current  drive  in 
MOSFETs,  higher  mobility  and  higher  inversion- 
layer  capacitance  are  required.  In  terms  of  the 
effective  mass,  these  requirements  mean  lighter 
effective  mass  parallel  to  Si/Si02  interface,  which 
increases  mobility,  and  heavier  effective  mass 
perpendicular  to  the  interface,  which  maximizes 
Gnv  ■  From  this  viewpoint,  the  2-fold  valleys  on  a 
(100)  surface  are  an  optimum  electronic  system  in 
the  inversion  layer  of  Si,  as  summarized  in 
Figure  1.  In  conventional  MOSFETs,  however, 
the  energy  difference  in  the  lowest  subband  energy 
of  the  2-fold  valleys,  E0,  and  that  of  the  4-fold 
valleys,  E'0,  is  small.  As  a  result,  the  occupancy  of 
the  2-fold  valleys  in  conventional  MOSFETs  is  not 
sufficiently  large  at  room  temperature.  Therefore, 
an  effective  strategy  to  obtain  the  higher  current 
drive  is  to  increase  the  energy  difference  in  the 
lowest  subband,  E'0  —  E$.  There  are  two  possible 
ways  to  realize  such  a  modulation  in  the  subband 
structures.  One  way  is  to  apply  the  tensile  strain  in 
Si  substrate.  It  is  known  that  tensile  biaxial  strain 
parallel  to  the  interface  causes  the  band  splitting 
between  the  2-fold  valleys  and  the  4-fold  valleys. 
This  structure  corresponds  to  strained  Si  MOS¬ 
FETs.  The  other  way  is  to  utilize  the  size  effect  due 
to  built-in  confinement  potential  such  as  the  band 
discontinuity  at  hetero-interfaces.  For  example, 
when  SOI  films  in  SOI  MOSFETs  is  thinner  than 
the  inversion  layer  in  bulk  MOSFETs,  the  sub¬ 
band  structures  can  be  significantly  modified  and 
the  resultant  occupancy  of  the  2-fold  valleys  can 
increase  by  the  size  effect  of  the  SOI  film  itself.  The 
effectiveness  of  this  subband  engineering  is  exam¬ 
ined  through  the  subband  and  mobility  calcula¬ 
tions. 

5.1.  Strained  Si  MOSFETs 

It  is  known  that  tensile  strain  in  Si,  which  is 
typically  seen  in  Si  grown  on  relaxed  SiGe,  causes 
the  band  splitting  between  the  2-  and  the  4-fold 


valleys,  which  leads  to  the  increase  in  E0  -  E'0  and 
the  resulting  higher  occupation  in  the  2-fold 
valleys.  As  shown  in  Figure  10,  it  has  been 
confirmed  experimentally  [28],  [29]  that  the  inver¬ 
sion-layer  mobility  in  strained  Si  MOSFETs  at 
room  temperature  increases  up  to  around  twice  as 
high  as  in  conventional  MOSFETs  with  an 
increase  in  tensile  strain,  which  is  controlled  by 
the  Ge  content  of  SiGe  substrates.  The  calculated 
phonon-limited  mobility  for  2DEG  in  strained  Si 
is  also  shown  in  Figure  10.  Good  agreement  with 
the  experimental  mobility  ratio  is  obtained  by 
using  the  higher  coupling  constants  with  inter¬ 
valley  phonons  (parameter  set  C).  This  is  another 
evidence  for  the  higher  coupling  of  intervalley 
phonons  with  2DEG  in  the  inversion  layer  than 
with  bulk  electrons,  as  described  in  3.1.  The 
calculated  results  have  also  revealed  that,  in 
addition  to  the  preferential  occupation  of  the  2- 
fold  valleys,  the  suppression  of  intervalley  scatter- 


Substrate  Ge  Content  [  %  ] 


FIGURE  10  Mobility  enhancement  factor  (the  ratio  of  the 
mobility  in  strained  Si  MOSFETs  to  that  in  unstrained 
(conventional)  Si  MOSFETs)  versus  the  Ge  content  of  a  SiGe 
substrate  on  which  the  strained  layer  is  formed.  The  amount  of 
strain  in  strained  Si  is  in  proportion  to  the  Ge  content.  The 
experimental  results  were  taken  from  [28].  The  dotted,  dashed 
and  solid  lines  represent  the  calculated  results  using  the 
intervalley  phonon  scattering  parameter  sets  A,  B  and  C. 
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ing  due  to  the  band  splitting  leads  to  the  mobility 
enhancement  in  strained  Si  MOSFETs  [6,  30].  It 
has  also  been  found  that  Cinv  in  the  inversion  layer 
of  Si  slightly  increases  with  tensile  strain. 


5.2.  Ultra-Thin  Film  SOI  MOSFETs 

In  order  to  examine  the  possibility  of  the  modula¬ 
tion  of  the  subband  energy  and  the  resultant 
occupancy  of  the  2-fold  valleys  in  SOI  MOSFETs 
with  SOI  films  thinner  than  the  inversion  layer  in 
bulk  MOSFETs,  the  calculations  of  the  subband 
structures  and  the  phonon-limited  mobility  were 
performed  [31].  Figure  11  shows  the  calculated 
phonon-limited  mobility  as  a  function  of  the  SOI 
thickness,  Tsol.  With  decreasing  Tsol  the  mobility 
decreases  slightly  at  first,  and  increases  with 
decreasing  Tsoi  from  5  nm  to  3  nm.  With 
decreasing  TSoi  from  3  nm,  the  mobility  decreases 
again.  It  is  found,  moreover,  that  the  mobility  with 
the  SOI  thickness  of  around  3  nm  can  be  higher 
than  that  in  conventional  MOSFETs.  This  en¬ 
hancement  is  attributable  to  the  fact  that,  with 


decreasing  the  SOI  thickness,  the  electron  occupa¬ 
tion  of  the  2-fold  valleys  becomes  higher,  because 
of  the  increase  in  E'0.  The  calculated  occupancy  of 
the  2-fold  and  the  4-fold  valleys  is  shown  as  a 
function  of  TSoi  in  Figure  12.  It  is  confirmed  that 
the  occupancy  of  the  2-fold  valleys  increases  with  a 
decrease  in  Tsoi  and  becomes  almost  100% 
around  3  nm  of  Tsoi*  The  reason  why  E'0  increases 
more  rapidly  than  E0  with  a  decrease  in  Tsoi  is 
that  the  inversion  layer  thickness  of  the  4-fold 
valleys  is  thicker  than  that  of  the  2-fold  valleys 
and,  thus,  E'0  is  more  sensitive  to  the  SOI  physical 
thickness  than  E0. 

It  has  been  also  confirmed  that  Cinv  in  SOI 
MOSFETs  with  TSoi  thinner  than  5  nm  is 
significantly  improved,  because  the  inversion-layer 
thickness  is  determined  by  the  SOI  thickness  itself 
[32].  These  results  demonstrate  that,  if  the  SOI 
MOSFETs  that  have  the  perfectly  flat  interfaces 
with  the  gate  oxide  and  buried  oxide  can  be 
realized,  the  performance  of  SOI  MOSFETs  with 
Tsoi  of  3  nm  to  5  nm  can  be  much  higher  than  that 
of  bulk  MOSFETs. 
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FIGURE  11  Calculated  phonon-limited  mobility  in  SOI 
MOSFETs  as  a  function  of  SOI  thickness  at  Ns  of  3x  1012  cm-2 
and  9x  1012  cm"2.  The  substrate  impurity  concentration  and  the 
buried  oxide  thickness  are  5x10^  cm-3  and  lOOnm,  respec¬ 
tively. 


SOI  Thickness  [nm] 

FIGURE  12  Calculated  electron  occupancy  of  the  2-fold  and 
the  4-fold  valleys  as  a  function  of  the  SOI  thickness  at  Ns  of 
3xl0-12  cm"2. 
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6.  CONCLUSION 

The  importance  of  the  2D  features  of  the  inversion 
layer  electrons  on  the  current  drive  of  MOSFETs  at 
room  temperature  has  been  re-examined  through 
the  systematic  experiments  and  the  self-consistent 
subband  calculations.  Inversion-layer  mobility  and 
inversion-layer  capacitance  can  be  understood  well 
from  the  viewpoint  of  the  2D  subband  and  the  2D 
carrier  transport.  However,  further  refinements  in 
the  scattering  parameters  and  models  are  still 
needed  to  have  fully-quantitative  description  of 
inversion-layer  mobility.  On  the  other  hand,  satura¬ 
tion  velocity  in  the  inversion  layer  is  considered  to 
be  much  less  affected  by  the  2D  features  of  carriers. 

Based  on  the  knowledge  of  the  Si  subband 
structures,  an  effective  strategy  for  higher  MOS- 
FET  performance,  the  enhancement  of  the  occu¬ 
pancy  of  the  2-fold  valleys  on  a  (100)  surface,  has 
been  presented.  Higher  mobility  of  strained  Si 
MOSFETs  and  SOI  MOSFETs  with  ultra- thin 
SOI  films,  both  of  which  are  the  typical  examples 
of  this  subband  engineering,  has  been  confirmed 
by  the  theoretical  calculations. 
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Monte  Carlo  Simulations  of  Impact  Ionization 
Feedback  in  MOSFET  Structures 
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Although  impact  ionization  feedback  is  recognized  as  an  important  current  multi¬ 
plication  mechanism,  its  importance  as  a  carrier  heating  mechanism  has  been  largely 
overlooked.  This  work  emphasizes  the  inclusion  of  impact  ionization  feedback  in  Monte 
Carlo  device  simulations,  and  its  implications  for  carrier  heating  in  sub-micron  CMOS 
and  EEPROM  technologies. 

Keywords:  Impact  ionization,  feedback,  flash  EEPROM,  secondaries,  distribution  function  tails 


INTRODUCTION 

Monte  Carlo  (MC)  transport  simulation  is  a 
widely  recognized  tool  capable  of  accurately 
obtaining  the  high  energy  tail  of  the  electron 
energy  Distribution  Function  (DF)  responsible  for 
impact  ionization,  oxide  degradation  and  gate 
currents,  /G,  in  MOSFETs.  MC  simulations  and 
measurements  of  these  hot  electron  effects  have 
shown  that  the  oxide  interface  DF  in  MOSFETs  is 
composed  of  two  parts  [1]:  the  conventional 
channel  electron  DF,  and  a  secondary  electron 
DF  coming  from  Impact  Ionization  (II)  FeedBack 
(FB)  [2].  The  channel  electron  DF  is  enhanced  by 
Electron -Electron  (EE)  scattering,  and  at  low 
biases  is  principally  populated  by  this  effect. 
Although  the  secondary  electron  DF,  generated 
by  the  coupled  impact  ionization  of  electrons  and 
holes  is  usually  ignored,  it  can  dominate  the 


channel  DF  tail.  The  relative  importance  of  the 
two  depends  on  the  channel  and  substrate  doping, 
drain  junction  depth,  oxide  thickness  and  most 
importantly,  the  substrate  bias,  FBs-  This  work 
emphasizes  the  inclusion  of  impact  ionization 
feedback  in  Monte  Carlo  device  simulation,  and 
its  implications  for  carrier  heating  in  sub-micron 
MOSFETs.  In  addition,  results  from  a  new  class  of 
EEPROM  devices  based  on  the  II  FB  effect  will  be 
discussed. 


HEATING  BY  II  FEEDBACK  IN  MOSFETs 

In  the  following,  simulations  have  been  performed 
using  the  full-band  MC  transport  simulator,  SMC 
[3],  as  a  post-processor  for  the  device  simulator 
PADRE  [4].  Doping  profiles  and  device  geometry 
are  computed  using  the  2D  process  simulator, 
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PROPHET  [4],  and  electric  fields  are  computed  for 
a  given  bias  condition  by  PADRE.  Then,  given 
these  fields,  SMC  solves  for  the  device  DFs  in  a 
manner  similar  to  [5].  For  more  details,  see  [6]  and 
references  therein.  Also,  in  this  section,  in  order  to 
clarify  the  essential  elements  of  the  II  FB  process 
in  MOSFETs,  EE  scattering  will  not  be  consi¬ 
dered. 

Figure  1  illustrates  the  general  phenomenon  of 
II  feedback  in  an  nMOSFET  [2],  Channel  elec¬ 
trons,  ex,  are  injected  into  the  drain  where  they  II 
forming  low  energy  electron-hole  pairs  with  current 
II  multiplication  M\.  The  secondary  electrons,  e2 
leave  through  the  drain  while  the  secondary  holes, 
h2,  diffuse  to  the  Drain-Substrate  Junction  (DBJ), 
are  heated  by  its  fields  and  are  injected  into  the 
substrate  where  they  II  again  with  multiplication 
M2  forming  e3  and  h3.  The  h 3  holes  leave  through 
the  substrate,  but  the  e3  electrons  fall  back  through 
the  DBJ  and  vertical  gate  controlled  potential 
drops  reaching  the  oxide  interface.  This  process 
continues  with  e3  ionizing  leading  to  a  series  of 
pair  productions  alternating  between  electrons  and 
holes  (II  feedback)  with  multiplications  M3,  M4, 
etc.  Here,  IB  —  Is(\  +  M\  +  MXM2  +  MXM2 
M3  +  . . .).  The  DBJ  is  in  breakdown  when  this 
series  diverges,  but  in  the  following,  the  devices  are 
not  in  breakdown  and  all  the  Mt<  1. 

A  Lch  —  0.25  pm  device  with  strong  II  FB  effects 
has  been  simulated  to  quantify  the  feedback  effect. 


Figure  2  shows  the  potential  energy  along  the 
channel  (from  A  to  B  in  Fig.  4)  and  from  the  drain 
to  the  substrate  (B  to  C)  with  VGS  =  2  V, 
Eds  =  3  V.  Figure  3  shows  the  DF  integrated  over 
the  device  domain.  The  ex  DF  (no  II  FB)  shows  a 
rapid  decay  for  energies  above  the  pinch-off 
potential  energy  drop,  Vp  =  2.5  eV.  There  are  no 
channel  electrons  above  the  oxide  conduction 
band  discontinuity  of  about  A0x  =  3.1eV,  but 
there  are  many  at  the  II  rate  threshold  of  about 
1.1  eV,  so  /B  >  0.  The  e3  contribution  is  also  shown 
in  Figure  3  (II  FB,  full  <fesec).  The  channel 
electrons  ionize  with  Mx  =  0.065,  initiating  the 
FB  process.  The  h2  holes  ionize  with  M2  =  0.02,  so 
that  the  current  carried  by  the  secondary  e3 
electrons  is  quite  small:  IeJIex  =  M\M2  =  0.0013. 
Although  the  DBJ  is  far  from  breakdown,  there  is 
a  broad  tail  extending  to  high  energies  >  Aqx, 
dominating  the  ex  DF  above  Vp.  It  is  sufficient  to 
consider  only  the  e3  secondary  electron  DF  since 
the  DBJ  is  not  in  breakdown. 

The  energies  of  e3  electrons  reaching  the  oxide 
interface  can  be  as  great  as  Efax  =  qV db  +  qV\ bi+ 
£sec,  where  <5£sec  is  the  energy  of  formation  of  the  e3 
electrons  by  the  II  of  the  h2  holes,  and  qVhi  is  the 


Distance  (pm) 


FIGURE  2  Potential  energy  through  LCh  =  0.25  \. im  device 
of  a  0.25  |im  process  from  A-B,  B-C  (Fig.  1). 
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FIGURE  3  DFs  integrated  over  the  device  domain  for  condi¬ 
tions  and  device  of  Figure  2. 


FIGURE  4  DFs  integrated  at  the  oxide  interface  for  condi¬ 
tions  of  Figure  2.  Approximate  energy  thresholds  for  substrate 
currents  (7Br)  and  gate  currents  (7gr)  are  labelled  with  arrows. 


built-injunction  potential.  Typically,  qV^  ~  0.7  to 
0.8  eV.  The  importance  of  6escc  (a  table  generated 
during  the  II  rate  calculation  [7]  from  the  full  Si 
band  structure)  is  clear  comparing  the  full  <fesec 
curve  to  that  computed  with  &sec  =  0  (Fig.  3).  The 
full  curve  extends  to  high  energies  uniformly, 


whereas  the  curve  with  no  energy  of  creation 
begins  to  drop  rapidly  for  energies  near  =  tfFDB  + 
qVbi  =  4eV. 


GATE  CURRENTS  BY  II  FEEDBACK 

It  is  clear  from  above  that  FBS  should  have  a  large 
effect  on  the  tail  of  the  DF.  When  a  negative 
substrate  bias,  FBS  <  0,  is  applied,  changes  in  the 
channel  potential  are  small,  but  qV^  changes 
rigidly  with  FBS  (see  Fig.  2).  Therefore,  ex  heating 
and  /B  are  not  strongly  affected,  whereas  e3  heating 
and  IG  should  change  exponentially.  Figure  4 
shows  the  electron  DF  integrated  along  the  oxide 
interface  for  FBS  =  0,-1  and  —2  V.  The  low  energy 
channel  DF  (below  2.5  eV)  changes  very  little 
( <  2  X)  whereas  the  secondary  tails  responsible  for 
IG  change  exponentially  1000X)  as  FBS  goes 
from  0  to  — 2  V.  This  is  confirmed  experimentally 
in  Figure  5  which  shows  Igr  =  Ig/Is  versus 
Ibk^Ib/Is  for  Fds=FGs  at  various  FBS.  For  a 
given  value  of  /BR  (which  represents  an  equivalent 
channel  heating)  JGr  increases  exponentially  as 
FBs  is  made  more  negative.  This  result  contradicts 
the  “effective  temperature  models”  which  predict  a 
direct  correlation  between  7Gr  and  /BR  [8]  based 
on  the  assumption  that  both  currents  are  gener¬ 
ated  by  one  thermalized  channel  DF.  Figure  5  also 
shows  a  comparison  between  simulated  and 
measured  IGR  and  /BR.  Qualitative  trends  are 
clearly  reproduced  by  the  simulation.  Considering 
the  approximate  treatment  oxide  injection  and 
transport,  the  remarkably  good  quantitative 
agreement  with  experiment  is  more  questionable. 

In  addition  to  enhancing  the  high  energy  tail,  II 
FB  also  changes  the  spatial  distribution  of  the  hot 
carriers.  Figure  6  shows  the  number  of  hot 
secondary  electrons  with  energies  above  the  oxide 
barrier  (3.2  eV)  as  a  function  of  position  along  the 
oxide  interface  (conditions  of  Fig.  4).  As  FBs  is 
made  more  negative,  the  hot  carriers  spread  out 
into  the  channel,  enhancing  LG  when  FGs  is  low. 
For  IG  generation,  the  fields  in  the  oxide  at  the 
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Relative  Substrate  Current  lBR 

FIGURE  5  Measured  and  simulated  /G R  versus  /Br  for  diff¬ 
erent  Fbs  on  a  £ch  =  0.25  pm  device.  Squares,  0.5  V  steps  and 
circles,  0. 1  V  steps  down  from  VDS=VGS  =  4V. 
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FIGURE  6  DFs  integrated  at  the  oxide  interface  for  condi¬ 
tions  of  Figure  2. 


point  of  injection  must  accelerate  the  electrons  to 
the  gate  electrode.  When  FGD<0,  accelerating 
fields  only  exist  in  the  channel  back  towards  the 
source.  The  secondaries,  coming  from  the  sub¬ 
strate,  provide  oxide  injection  at  these  points.  This 
figure  suggests  that  IG  enhancement  by  VBS  is 
stronger  the  smaller  VGS  is  compared  to  FDS.  This 
is  confirmed  in  Figure  7  which  shows  measured 
gate  injection  efficiency,  IGR  versus  VGS  as  a 
function  of  VBS  for  a  0.25  pm  device  [9].  Note  also 
that  for  FBs  =  —3V  there  is  strong  gate  injection 
for  VGS  all  the  way  down  to  the  threshold  voltage 
of  the  device,  1.3  V.  For  FGS,  1.3  V,  IG  will  go  to 
zero  with  the  sub-threshold  drain  current. 

TWO-PART  DISTRIBUTION  FUNCTIONS 

As  noted  in  the  introduction,  EE  scattering  can 
extend  the  channel  DF  to  higher  energies  than  Vp 
and  Fds  [1,  10-11].  This  EE  enhanced  channel 
DF  will  compete  with  the  II  FB  at  high  energies. 
The  relative  importance  of  the  two  depends 
primarily  on  M2  and  T,  factors  which  are  con¬ 
trolled  primarily  by  the  sharpness  of  the  DBJ 


VGs(V) 


FIGURE  7  Measured  IGR  for  different  FBs  as  a  function  of 
FGs  on  a  LCH  =  0-25pm  device. 


junction  and  the  strength  of  the  gate  field.  M2  and 
T  become  large  for  shallow  junction  devices  with 
high  channel  or  junction  doping,  and  as  shown 
above,  negative  FBs-  One  easy  way  to  determine 
the  importance  of  the  two  is  through  sensitivity  to 
FBs.  The  channel  DF  is  fairly  independent  of  FBs, 
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even  when  EE  scattering  is  included,  but  the  II  FB 
tails  are  strongly  affected  by  it.  In  general,  for  a 
hot  electron  effect  with  a  threshold  energy  of  ETH, 
there  will  always  exist  a  FgS  such  that  for 
Fbs  <  FgS,  the  secondaries  will  dominate  the 
channel  DF.  For  IG,  ETH  is  3.2  eV  for  points  on 
the  interface  where  gate  fields  are  favorable  for 
injection.  When  FBs  <  FgS,  dIG/dVBS  will  become 
large.  In  devices  of  older  technology  generations  in 
which  II  FB  effect  tend  to  be  weak,  |  FgS|  can  be  as 
high  as  -7V  [1].  Figure  8  shows  IG  versus  FBs 
measured  on  two  devices.  The  curves  have  been 
normalized  by  the  same  factor  for  both  devices. 
DEVI  is  a  0.25  pm  device  with  strong  FB  effects 
similar  to  that  of  Figure  3,  and  DEV2  is  a 
/.(  ii  =  0.32  pm  device  optimized  for  high  IG  (see 
below).  The  measured  data  was  obtained  by 
measuring  floating-gate  Fjh  shifts,  and  the  simu¬ 
lations  were  performed  with  both  II  FB  and  EE 
scattering.  In  DEVI,  the  FB  DF  begins  to 
dominate  IG  for  FBS  <  -0.5  V  as  shown  by  both 
measurement  and  simulation.  In  DEV2,  the  FB 
DF  controls  IG  even  at  FBs  —  0  V.  Oxide  DFs  for 
DEVI  in  Figure  8  are  shown  in  Figure  9. 


FIGURE  8  Measured  and  simulated  IG  for  two  different 
devices:  DEVI,  similar  to  that  in  Figure  2,  and  DEV2  a 
Lc h  =  0.32  pm  device  optimized  for  high  IG. 
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FIGURE  9  Simulated  interface  DFs  for  different  Fbs  on 
DEVI  for  Figure  7  including  II  FB  and  EE  scattering. 
Approximate  /B  and  IG  thresholds  are  labelled  as  in  Figure  4. 


THE  CHISEL  FLASH  EPROM  CELL 

Nonvolatile  memories  (Flash  EPROMs)  use  hot 
carrier  gate  injection  for  programming.  Previous 
devices  have  relied  on  channel  hot  electron 
injection  for  programming  which  requires  high 
Kgs  (10  V  or  more)  and  FDs  (5-6V)  to  achieve 
adequate  programming  times  of  lOOps  or  faster. 
MC  simulations  and  measurements  like  those 
shown  above  suggested  that  the  II  FB  effect  could 
be  utilized  to  greatly  improve  programming 
performance  for  low  FDs  and  Fgs,  leading  to  a 
lower  power,  better  scaled  EPROM  cell.  Based 
upon  the  physical  understanding  provided  by  these 
simulations,  Flash  EPROM  devices  and  a  writing 
methodology  were  designed  to  optimize  the  II  FB 
process  to  produce  a  CHISEL  (channel  initiated 
secondary  electron)  EPROM  cell  [9]. 

Simulation  optimization  of  the  cell  concentrated 
on  the  following:  heavily  doped  junctions  increase 
substrate  current,  hole  multiplication,  M3  and  e3 
electron  transmission  to  the  gate,  T\  shallow  junc¬ 
tions  enhance  M\  and  T\  and,  Fbs  <  0  enhances  M2 
and  T.  CHISEL  memory  devices  (Fig.  10)  were 
fabricated  by  forming  a  stacked  gate  on  nominal 
0.25  pm  nMOS  devices  exhibiting  thin  oxides  and 
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shallow  junctions  (tox  =  6  nm,  xj  =  75  nm).  Because 
this  device  is  based  on  a  fully  scaled  CMOS 
technology,  it  is  ideal  for  scaling  into  the  deep 
submicron  regime.  Boron  halos  were  added  to 
increase  junction  doping,  hence,  M2  and  T.  Most 
importantly,  a  VBS  <  0  writing  scheme  was  adopted. 

Figure  11  shows  CHISEL  cell  programming 
transients  for  FDS  =  -FBS  =  2.5  and  3  V  and  a 
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FIGURE  10  Stacked-gate  CHISEL  EEPROM  device  struc- 
ture. 


FIGURE  11  Measured  programming  transients  for  a  CHI¬ 
SEL  EEPROM  cell. 


vBS(V) 


FIGURE  12  Time  to  reach  a  vth  change  of  IV  during  pro¬ 
gramming  of  a  CHISEL  EEPROM  cell. 


control  gate  bias  of  4  V;  efficient  programming  for 
times  less  than  100  ps  can  be  attained  with  biases 
much  lower  than  standard  Flash  devices  require. 
Note  also  that  the  cell  has  strong  programming 
until  it  turns  itself  (high  IG  with  floating  gate 
potentials  down  to  the  cell  threshold  voltage  -  Fig. 
7).  Figure  12  highlights  the  strong  writing  time 
dependence  on  FBs  as  anticipated. 

CONCLUSIONS 

Physically  based  MC  transport  simulation  have 
clarified  the  importance  of  II  FB  and  the  two-part 
nature  of  the  high  energy  DF  tails  in  sub-micron 
MOSFETs,  showing  good  agreement  with  experi¬ 
ment.  This  knowledge  has  led  to  the  creation  of  a 
new  class  of  scalable  low  voltage,  low  power 
EPROM  devices. 
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Inclusion  of  Quantum  Confinement  Effects 
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The  design  of  Monte  Carlo  FET  simulations  is  discussed,  with  specific  attention  to  the 
methods  used  to  describe  quantum  confinement  effects.  A  new  model  is  presented, 
which  employs  self-consistent  coupling  of  Schrodinger,  Poisson  and  Monte  Carlo 
algorithms,  and  explicit  calculation  of  the  scattering  rates  between  confined  and 
unconfined  states.  Comparisons  between  the  new  model  and  a  standard  semi-classical 
Monte  Carlo  model  are  presented  for  a  0.1  pm  gate-length  Ino.52Alo.48As/Ino.53 
Ga0.47As/InP  MODFET.  Whilst  the  quantum  model  yields  minor  corrections  in  the 
predicted  output  characteristics,  it  is  found  that  these  results  can  be  achieved  without 
repeated  iterations  of  the  Schrodinger  equation. 


Keywords:  Monte  Carlo,  HEMT,  MODFET,  quantum 


1.  INTRODUCTION 

The  methodology  for  developing  semiconductor 
device  simulations  using  Monte  Carlo  algorithms 
is  now  reasonably  mature.  The  scheme  of  coupling 
an  Ensemble  Monte  Carlo  particle  simulation  to  a 
2-dimensional  (2D)  Poisson  Solver  has  been 
successfully  applied  to  both  FETs  and  HBTs, 
and  now  represents  the  standard  method  in  this 
subject  area  [1-2].  Numerous  enhancements  to  the 
basic  model  have  been  developed  to  obtain  greater 
accuracy -in  particular  the  use  of  exact  [3]  or 
parameterised  [4]  Tull  bandstructure’  models. 


However,  the  standard  Monte  Carlo  device 
simulation  is  semi-classical:  electrons  and  holes  are 
modelled  as  discrete  particles,  whilst  scattering 
between  states  is  described  using  quantum  mecha¬ 
nical  perturbation  theory.  It  has  long  been  appre¬ 
ciated  that  in  certain  situations  -  such  as  the 
MOSFET  inversion  layer  and  the  MODFET 
channel  layer  -  quantum  confinement  effects  may 
be  significant.  Inclusion  of  such  effects  requires,  in 
principle,  self-consistent  solution  of  the  2D  Poisson 
and  Schrodinger  equations,  and  the  Monte 
Carlo  transport  algorithm;  a  task  which  appears 
far  too  computationally  expensive  for  practical 
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application.  Consequently,  there  have  been  very  few 
attempts  to  incorporate  quantum  confinement.  In 
this  paper,  we  review  briefly  some  previous  Monte 
Carlo  FET  models  in  which  the  problem  has  been 
addressed,  and  then  discuss  our  own  Monte  Carlo/ 
Poisson/Schrodinger  MODFET  simulator. 

2.  REVIEW  OF  MONTE  CARLO  FET 
MODELS 

Quantum  confinement  effects  were  actually  con¬ 
sidered  in  one  of  the  earliest  Monte  Carlo 
MODFET  models.  Ravaioli  and  Ferry  [5] 
calculated  the  first  two  quantised  energy  levels  in 
the  MODFET  channel  using  a  triangular  well 
approximation.  This  yielded  an  analytic  solution 
parameterised  by  the  perpendicular  (gate-sub¬ 
strate)  electric  fields  thus,  the  energy  level  values 
could  be  easily  updated  when  the  Poisson  equation 
was  re-solved,  and  the  effect  of  potential  variation 
along  the  channel  could  be  included.  The  scatter¬ 
ing  rates  between  quantum  confined  states  were 
parameterised  by  energy  level  separation  and 
carrier  density,  and  hence  could  also  be  updated. 
Clearly,  the  main  shortcoming  of  this  otherwise 
elegant  approach  is  the  validity  of  the  triangular 
well  approximation  these  devices. 

Park  and  Brennan  [6]  developed  a  MODFET 
simulator  which  also  included  solutions  of  the 
Schrodinger  equation  for  the  first  two  subbands  in 
the  channel.  The  eigenvalues  were  obtained  from  a 
numerical  solution  of  the  1 -dimensional  (ID) 
Schrodinger  equation,  whilst  the  scattering  rates 
were  obtained  using  approximate  variational 
wavefunctions.  However,  the  Schrodinger  equa¬ 
tion  was  solved  assuming  an  average  2D  carrier 
density  in  the  channel,  hence  no  position  depen¬ 
dence  was  included. 

A  key  aspect  of  including  quantum  confinement 
in  FET  simulations  is  the  method  used  for  descri¬ 
bing  transitions  between  (quantum)  confined  and 
unconfined  (classical  or  bulk-like)  states.  This 
problem  is,  itself,  and  artifact  of  the  simulation 
model:  in  reality  all  states  can  be  described  using 


quantum  mechanics;  however,  the  use  of  Monte 
Carlo  particle  algorithm  leads  inevitably  to  a 
classical  model  of  electron  kinematics.  In  both 
the  above  models,  electrons  outide  the  channel 
were  modelled  as  semi-classical  particles.  Electrons 
in  the  channel  layer  which  acquired  a  kinetic 
energy  in  excess  of  the  barrier  height  were  also 
‘converted’  into  classical  particles. 

A  similar  scheme  was  used  by  Fischetti  and  Laux, 
in  their  simulation  of  inversion  layer  MOSFETs  [7]. 
Their  simulation  code  includes  Monte  Carlo,  2D 
Poisson  and  ID  Schrodinger  algorithms,  which  are 
solved  self-consistently.  The  scattering  rates  be¬ 
tween  confined  states  are  calculated  using  the  exact 
numerical  wavefunctions.  However,  Fischetti  and 
Laux  also  conclude,  as  above,  that  it  is  not  practical 
to  describe  all  electrons  in  the  device  by  Schrodinger 
eigenfunctions.  Hence  their  model,  too,  includes 
transitions  between  two  alternative  descriptions  of 
electronic  states.  Scattering  rates  were  calculated 
according  to  the  nature  of  the  initial  state:  for 
unconfined  electrons,  rates  for  transitions  between 
pairs  of  bulk-like  states  were  used,  whereas,  for 
electrons  confined  in  the  channel,  rates  for  transi¬ 
tions  between  pairs  of  quantum  confined  states  were 
used,  irrespective  of  the  energy  of  the  final  state. 
Such  a  scheme  again  necessitates  ‘conversion’  of 
confined  electrons  into  unconfined,  and  vice  versa. 
These  ‘conversions’  violate  energy  and  momentum 
conservation,  because  the  quantised  spectrum  of 
states  in  the  channel  has  fewer  degrees  of  freedom 
than  the  continuum  to  bulk  states.  Also,  the 
unconfined  ‘conversion’  process  contained  no 
physics  to  specify  in  which  subband  the  ‘converted’ 
electron  should  reside. 

3.  A  NEW  SELF-CONSISTENT 

SCHRODINGER-POISSON-MONTE 
CARLO  MODEL 

Our  MODFET  model  also  includes  self-consis- 
tently  coupled  Schrodinger,  Monte  Carlo  and 
Poisson  algorithms.  However,  in  an  attempt  to 
overcome  some  of  the  problems  associated  with 
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conversion  between  the  confined  and  unconfined 
descriptions  of  electrons  states,  we  have  designed  an 
alternative  model  of  the  scattering  and  kinematics 
involved  in  electron  transfer  in/out  of  the  channel. 

The  Schrodinger  equation  is  solved  numerically 
in  a  series  of  ID  slices  between  source  and  drain, 
but  self-consistency  between  the  electrostatic 
potential  and  quantum-mechanical  charge  distri¬ 
bution  is  maintained.  The  scattering  rates  between 
confined  states  in  the  MODFET  channel  are 
calculated  at  each  position  along  the  channel  using 
the  exact  numerical  wavefunctions,  and  are  re¬ 
calculated  periodically  throughout  the  simulation. 

Electrons  outside  the  channel  layer  are  modelled 
as  classical  particles.  Within  the  channel  layer, 
both  particle-like  electrons  and  quantum  confined 
electrons  are  permitted.  Quantum  confined  states 
are  calculated  for  the  first  n  subbands  in  the 
channel:  higher  subbands  are  assumed  to  merge 
into  a  continuum,  characterised  by  bulk-like 
states.  Transitions  between  confined  and  conti¬ 
nuum  states  are  described  solely  by  scattering 
processes.  For  confined-continuum  scattering,  the 
final  state  is  described  by  a  wavepacket  which  has 
a  single  k-value,  a  constant  amplitude  within  the 
channel  layer,  and  zero  amplitude  outside.  A  bulk¬ 
like  density  of  final  states  is  assumed.  After  such  a 
scattering  event,  the  final  state  is  modelled  as  a 
classical  particle  with  a  well  defined  energy  and 
wavevector:  its  real  space  co-ordinate  (in  the 
direction  prependicular  to  the  heterolayers)  is 
chosen  randomly  within  the  channel. 

Conversely,  for  any  particle  which  enters  the 
channel  there  is  a  non-zero  probability  of  scattering 
into  a  confined  state.  Transition  rates  are  calculated 
using,  for  the  initial  state,  a  wavepacket  as 
described  above,  and  a  Schrodinger  eigenfunction 
for  the  final  state.  The  final  density  of  states  is  that 
appropriate  to  the  quantum  confined  subband. 

Transitions  between  confined  and  continuum 
states  are  mediated  by  all  possible  interactions: 
acoustic  and  polar  optical  phonon,  and  alloy 
disorder  scattering.  Whilst  this  approach  avoids 
some  of  the  difficulties  with  energy  and  momentum 
conservation  discussed  above,  we  have  clearly 


deployed  new  approximations,  such  as  the  as¬ 
sumption  of  a  contiunuous  spectrum  of  states  and 
the  form  of  the  continuum  wavepacket.  Our  choice 
of  wavepacket  function  appears  to  best  reflect  the 
bulk-like  form  of  incoming  and  outgoing  particles. 
No  confinement  effects  are  included  for  the  X  and 
L  valleys,  and  scattering  rates  from  confined  T 
states  to  the  (unconfined)  X  and  L  valleys  are 
calculated  explicitly.  In  practice  we  have  included 
just  the  lowest  three  subbands  in  the  channel:  a 
fourth  subband  can  be  included  without  too  much 
computational  expense,  but  its  population  is 
invariably  small,  and  thus  will  have  little  effect 
on  the  simulation  results. 

4.  SIMULATION  PERFORMANCE 

To  test  our  simulation,  we  chose  for  simplicity  a  3- 
layer  MODFET  geometry.  The  materials  system 
was  lattice  matched  Ino.52Alo.48As/Ino.53Gao.47As/ 
InP,  which  has  large  conduction  band  offsets  and 
is  therefore  more  likely  to  be  affected  by  quantum 
confinement.  No  cap  layer  was  included  in  the 
simulation:  injection  and  collection  were  modelled 
using  ohmic  source  and  drain  regions  which 
spanned  both  the  supply  and  channel  layers  (see 
Fig.  1). 

Our  first  objective  was  to  ascertain  the  impor¬ 
tance  of  quantum  confinement  by  comparing 
results  of  our  new  code  with  those  from  our 
semi-classical  Monte  Carlo  model.  On  a  micro¬ 
scopic  level,  we  may  expect  two  principal  differ¬ 
ences:  in  the  distribution  of  charge  across  the 
channel  width,  and  in  the  electron  scattering  rates. 
Figure  2  shows  a  comparison  of  the  scattering 


Schottky  Gate 


FIGURE  1  Schematic  diagram  of  the  simulated  0.1  jxm  gate- 
length  InP  MODFET. 
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FIGURE  2  Full  curve:  total  (phonon  plus  alloy)  scattering 
rate  for  an  electron  in  the  lowest  subband  in  the  channel.  (The 
impurity  scattering  rate  was  not  included  in  this  diagram  since, 
although  the  rates  (at  low  electron  energies)  are  large,  the  effect 
of  impurity  scattering  is  relatively  small,  as  it  gives  no  energy 
relaxation  and  relatively  little  momentum  relaxation).  The 
electron  wavefunctions  were  calculated  at  the  mid-point  of  the 
channel,  for  gate  and  drain  biases  of  KG  =  0.2V,  KD  =  1.0V. 
Broken  curve:  scattering  rate  calculated  in  the  standard  semi- 
classical  model. 

rates  for  the  two  models.  The  rates  are  very  similar 
at  high  energies  (the  shoulder  at  E&Q.65  eV 
marks  the  onset  of  intervalley  scattering,  which 
has  identical  rates  in  both  models).  At  low  energies 
the  total  rate  in  the  quantum  model  exceeds  that  in 
the  classical  model,  due  to  the  difference  in  the  2D 
and  3D  densities  of  states. 

Figure  3  shows  the  charge  distrubution  at  the 
source  end  of  the  channel  in  the  quantum  and 
classical  simulations.  The  delocalised  nature  of  the 
electron  wavefunction  results  in  a  broader  charge 
distribution  in  the  quantum  model,  with  much  less 
charge  accumulation  at  the  channel/supply  layer 
interface  (d=200  A)-  where  the  channel  potential 
is  lowest. 

In  Figures  4-6  we  examine  the  extent  of 
channel  non-uniformity.  Figure  4  shows  typical 
potential  profiles  taken  from  the  source,  gate  and 
drain  regions  of  the  device.  Figure  5  shows  the 
wavefunctions  for  the  first  eigenstate  at  the  same 
three  locations.  There  is  very  little  difference  in  the 
wavefunctions  in  the  source  and  drain  regions, 
despite  the  much  lower  potential  at  the  drain  end 


FIGURE  3  Electron  distribution  across  a  vertical  slice  in  the 
source  region  of  the  channel.  The  horizontal  scale  measures 
distance  from  the  top  of  the  InAlAs  supply  layer  downwards 
towards  the  substrate,  with  the  channel  located  between 
200 -350  A. 


FIGURE  4  Potential  (conduction  band-edge)  profiles  along 
vertical  slices  taken  in  the  source,  gate  and  drain  regions 
respectively.  Horizontal  scale  as  in  Figure  2. 


of  the  device,  but  the  peak  in  the  wavefunction 
under  the  gate  is  shifted,  due  to  the  altered  shape 
of  the  channel  potential  well.  However,  Figure  6 
shows  that  the  shift  in  wavefunction  position  leads 
to  only  a  few  percent  difference  between  the 
confined -confined  scattering  rates  at  different 
positions  along  the  channel.  For  this  bias  config¬ 
uration  the  same  observations  apply  to  scattering 
from  higher  subbands  too:  thus  it  appears  that 
channel  non-uniformity  has  relatively  little  effects 
on  scattering. 
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FIGURE  5  Wavefunctions  of  the  lowest  eigenstate  in  the 
source,  gate  and  drain  regions  of  the  device:  KG  =  0.2V 
FD=1.0V. 


FIGURE  7  Ground-state  wavefunction  at  the  source  end  of 
the  channel,  obtained  using  minimum  and  maximum  updates 
(see  text).  FG  =  0.2V,  KD  =  1.0V. 


FIGURE  6  Confined -confined  (intrasubband)  polar  optical 
phonon  scattering  rate  for  an  electron  in  the  lowest  subband,  in 
the  source,  gate  and  drain  regions  of  the  channel.  KG  =  0.2V, 
Fd=1.0V. 


Energy  (eV) 

FIGURE  8  Confined -confined  (intrasubband)  polar  optical 
phonon  scattering  rate  for  an  electron  in  the  lowest  subband,  at 
the  source  end  of  the  channel,  obtained  using  minimum  and 
maximum  updates.  FG  =  0.2V,  KD=1.0V. 


The  main  computational  overhead  in  a  self- 
consistent  simulation  is  the  frequency  at  which  the 
relevant  parameters  must  be  re-calculated.  In 
Figures  7  and  8  we  compare  wavefunctions  and 
scattering  rats  from  simulations  with  minimum 
and  maximum  self-consistency.  In  both  cases  the 
simulation  was  run  using  the  classical  model,  until 
a  steady  state  was  attained.  In  the  minimum 
update  case,  the  Schrodinger  equation  was  then 
solved  once,  at  each  position  along  the  channel, 
using  a  time-averaged  potential.  The  channel 


scattering  rates  were  calculated  using  the  Schro¬ 
dinger  wavefunctions.  The  simulation  was  then 
continued,  using  the  quantum  model,  but  with  no 
re-calculation  of  the  channel  eigenstates  or  scatter¬ 
ing  rates.  In  the  maximum  update  case,  the 
simulation  was  continued,  re-solving  the  Schro¬ 
dinger  equation  at  every  Poisson  timestep.  The 
channel  scattering  rates  were  calculated  using  the 
original  set  of  eigenstates,  and  then  on  a  further 
five  occasions  during  the  simulation,  using  time- 
averaged  eigenstates  in  each  case. 
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Drain  Voltage  (V) 

FIGURE  9  Simulated  output  characteristics  for  the  0.1pm 
gate-length  InP  MODFET  using  the  classical  and  quantum 
models. 

In  practice,  we  found  surprisingly  little  differ¬ 
ence  between  the  steady-state  wavefunctions  for 
the  two  cases.  Figure  7  is  typical  of  all  the 
comparisons  we  examined,  for  wavefunctions  at 
different  positions  and  for  different  subbands. 
Correspondingly,  the  scattering  rates  show  little 
variation  with  update  frequency;  again,  Figure  8  is 
typical  of  the  various  rates  we  examined  for  this 
device  geometry  and  biasing. 

Figure  9  shows  that  the  quantum  model  predicts 
a  reduced  drain  current  throughout  the  bias  range: 
this  is  caused  by  a  reduction  in  the  predicted 
electron  velocities,  which  is  partly  due  to  the 
greater  phonon  scattering  (as  discussed  above). 
However,  it  must  also  be  appreciated  that  the 
quantum  model  includes  remote  impurity  scatter¬ 
ing,  a  concept  which  does  not  feature  in  the 
classical  description,  and  which  provides  some 
additional  momentum  relaxation  at  low  energies. 
Despite  the  difference  in  charge  distribution  in  the 
quantum  and  classical  models,  the  overall  electron 
density  in  the  channel  is  very  similar. 


5.  CONCLUSIONS 

Our  aim  in  this  work  has  been  to  examine  the 
importance  of  including  quantum  confinement 
effects  in  Monte  Carlo  MODFET  simulations. 


Clearly,  we  have  yet  to  cover  the  full  parameter 
space  for  such  devices:  however,  our  results  to  date 
imply  that  the  inclusion  of  confined  electron  states, 
and  of  quantum  confined  scattering  rates,  has  only 
a  modest  effect  on  predictions  of  device  perfor¬ 
mance,  relative  to  those  obtained  from  standard 
semi-classical  Monte  Carlo  models.  Whilst  the 
quantum  model  predicts,  in  general,  a  broader 
charge  density  distribution  across  the  channel  than 
does  the  calssical  model,  the  overall  charge  density 
in  the  channel  is  largely  unchanged.  Furthermore, 
it  appears  that  the  corrections  predicted  by  the 
quantum  model  can  be  obtained  without  recourse 
to  a  fully  self-consistent  Schrodinger/Poisson/ 
Monte  Carlo  scheme:  once  a  steady  state  solution 
is  obtained  from  the  classical  model,  the  quantum 
correction  can  be  obtained  by  a  single  calculation 
of  the  channel  eigenstates  and  corresponding 
wavefunctions.  This  represents  a  substantial  sav¬ 
ing  in  CPU  time. 
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The  two-dimensional  multi-layered  HEMT  is  modelled  isothermally  by  solving  the 
Poisson  and  current  continuity  equations  consistently  with  the  Schrodinger  equation.  A 
multigrid  method  is  used  on  the  Poisson  and  current  continuity  equations  while  the 
electron  density  is  calculated  at  each  level  by  solving  the  Schrodinger  equation  in  one¬ 
dimensional  slices  perpendicular  to  the  layer  structure.  A  correction  factor  is  introduced 
which  enables  relatively  accurate  solutions  to  be  obtained  using  a  low  number  of 
eigensolutions.  A  novel  method  for  discretising  the  current  density  which  can  be 
generalised  to  the  non-isothermal  case  is  described.  Results  are  illustrated  using  a  two 
layer  AlGaAs-GaAs  HEMT. 


Keywords:  HEMT,  Schrodinger,  Poisson,  current-continuity,  energy  transport,  Bernoulli  function, 
multigrid,  GaAs,  AlGaAs 


1.  INTRODUCTION 

This  paper  addresses  the  problem  of  solving  the 
modelling  equations  for  High  Electron  Mobility 
Transistors  (HEMTs)  in  which  the  Schrodinger 
equation  is  solved  explicitly.  A  basic  4-layer 
structure  consisting  of  layers  of  AlMGai_MAs  with 
different  proportions  of  mole  fraction  u  of  alumi¬ 
nium  content  is  shown  in  Figure  1  with  the  contacts 
arranged  along  the  x-axis  and  with  the  j-axis 
perpendicular  to  the  layer  boundaries.  Such  devices 
have  been  shown  to  operate  up  to  213  GHz  [1]. 


Fast  operation  is  achieved  because  the  electrons 
near  the  layer  interfaces  (the  “2-dimensionar 
electrons)  are  confined  in  narrow  potential  wells 
and  suffer  little  scattering.  The  equations  govern¬ 
ing  the  modelling  of  the  device  are  the  Schrodin¬ 
ger,  Poisson,  current  continuity,  and  the  energy 
transport  equations.  In  this  paper  we  will  apply 
some  new  techniques  to  the  isothermal  case, 
although  they  may  be  easily  generalised  to  the 
non-isothermal  case. 

Many  attempts  have  been  made  to  obtain  robust 
algorithms  and  rapid  numerical  solutions  of  the 
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FIGURE  1  A  cross-section  of  a  four-layer  HEMT  showing 
the  positions  of  the  contacts  and  the  layer  structure. 


equations  which  model  the  devices  [1-13].  A 
special  function  (the  C-function)  has  been  devel¬ 
oped  [2]  which  allows  both  the  current  continuity 
and  energy  transport  equations  to  be  discretised  in 
a  consistent  manner.  A  multigrid  method  [3]  has 
been  developed  and  applied  to  a  one  dimensional 
model,  and  a  more  efficient  method  of  including 
the  eigensolutions  of  the  Schrodinger  equation  has 
been  developed  [4]. 

Figure  2  shows  the  band  offset  structure  between 
different  layers.  This  structure  will  depend  on  the 
proportion  u  of  Al  in  the  layer  AlwGa2_wAs.  The 
equations  to  be  solved  are: 

(i)  The  Poisson  equation 

V  •  (e0erViP)  =  -q(N+  - n-T~ )  (LI) 

for  the  solution  of  the  electrostatic  potential  -0. 


Here,  q  is  the  magnitude  of  the  electron  charge,  n  is 
the  total  electron  density,  and  e0  is  the  permittivity 
of  free  space.  Writing  EF=—q<j)  as  the  quasi  Fermi 
level,  (j)  is  fixed  at  the  bias  values  on  the  contacts. 
The  values  of  ^  on  the  source  and  drain  are  given 
by  putting  the  electron  density  equal  to  the  doping 
density  using  the  Fermi  function  expressions 
described  below.  We  put  =  Eh-(f)b~(j)  on  the 
gate  where  (j)b  =  -0.7  V  is  the  built-in  potential. 

(ii)  In  the  steady  state  we  take 

V-J  =  0,  J  =  —  qfmVcj)  (1.2) 

for  the  current  density  J.  Generally  the  mobility  /i 
may  depend  on  the  field. 

(iii)  The  Schrodinger  equation 

—  ~2  ^  ^  ^  "b  ( Vxc  Eh  —  q*jj)tii  =  A,£, 

(1.3) 

where  ^  is  the  electrostatic  potential  and  and  Ay 
(z  =  0, 1,2,. . .)  are  the  energy  eigenfunctions  (nor¬ 
malised)  and  eigenvalues  respectively.  The  forms 
of  the  exchange  correlation  energy  Vxc,  effective 
mass  m,  relative  permittivity  en  conduction  band 
discontinuity  Eh,  N J  and  T~  are  taken  from 
Adachi  [5].  The  form  of  the  kinetic  energy 
operator  (— S2/2)V  •  (l/m)V  is  taken  for  its  he- 
rmitian  qualities  [14,  15].  The  full  two-dimen¬ 

sional  solution  of  the  Schrodinger  equation  is  very 
time  consuming,  and  in  practice  is  solved  in  one 
dimensional  slices  perpendicular  to  the  layer 
structure  (that  is,  in  the  ^-direction).  We  therefore 
solve  the  one  dimensional  equation 


Cl 

Eh 

E3l 

E32 

Evl  ""  "  " . 

AlGaAs 

GaAs 

FIGURE  2  The  band  offset  structure  between  different 
layers. 


(1.4) 

by  imposing  the  boundary  conditions  £,(0)  = 

6(7)  =  0(i  =  0, 1,2,...). 

The  electron  density  n  inside  the  quantum  wells 
will  be  given  in  terms  of  the  eigensolutions  of  this 
equation.  The  total  electron  density  is  given  by 
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n  =  n2  +  n3  where  n2  (for  the  “2-dimensional” 
electrons)  is  the  contribution  from  the  sub-bands 
given  by  the  solution  of  the  Schrodinger  equation 
and  n3  (for  the  “3-dimensional”  electrons)  is  the 
bulk  electron  density.  We  first  choose  a  maximum 
number  L  of  eigensolutions  of  the  Schrodinger 
equation  with  which  to  work.  Outside  the  poten¬ 
tial  well  defined  by  \l-i  <  Ec  we  have  n2—  0  and 

n3  =  NciF{^(EF-Ec)j  (1.5) 

where  Nc 3  =  2(2ivnkT/ h2)X  5 . 

Inside  the  well  we  must  avoid  the  double 
counting  of  contributions,  and  thus  we  take  (for 
the  one  dimensional  Schrodinger  solution  only) 

«2  -  Nc2  Uy)\2  ln(l  +  e^kr^-Xl))  (1-6) 

i=0 


2.  SOLUTION  OF  THE  EQUATIONS 


On  writing  a  =  (1  /kT)(Ef  -  Xl-i)  and  b  =  (1/ 
kT){\L-\  -  Ec),  Equation  (1.7)  becomes 


«3  =  Nc  3 


1  +  ex~a 


(2.1) 


It  is  this  integral  that  we  wish  to  approximate. 
Now  define  a  function  c{a,  b)  such  that 


n3  =  Nc3{Fk(a)  +  c{a,b)  ln(l+0).  (2.2) 

It  has  been  shown  [4]  that,  within  the  working 
parameter  range  of  the  variables  given  typically  by 
-8.0  <a< 8.0,  0  <  b  < 8.0,  the  function  c(a,  b )  can 
be  approximated  by 

c{a ,  b)  =  (0.782  -  0.023a)\4.  (2.3) 


where  Nc2  =  AirnikT/h 2,  and 

_ 1  /8m^  (<»  (E-Ecf 

”3_27rU2J  At_,l+exp((lAT)(E-EF)) 

_  2  /°°  (x+((l/kT)(XL^-Ec)f 

~  c3V*Jo  l+exp{x-{l/kT)(EF-\L-i)) 

(1.7) 

Inside  the  well  the  integral  in  Equation  (1.7) 
involves  two  parameters  (l/kT)(\L-\  —  Ec)  and 
(l/kT)(Ef-  and  its  evaluation  requires 

considerable  computing  time.  An  approximation 
to  this  integral  has  been  developed  [4]  which 
speeds  up  the  calculation  by  allowing  a  relatively 
small  number  of  eigensolutions  to  be  used.  Outside 
the  well  the  electron  density  is  given  in  terms  of  the 
Fermi  function  F(  i/2)  only,  and  the  approximation 
of  Bednarczyk  and  Bednarczyk  [16]  is  used  for  the 
evaluation. 

In  the  following  section  we  briefly  describe  this 
correction  process  and  introduce  a  new  method  of 
writing  the  current  density  J  so  that  the  Bernoulli 
function  method  may  be  applied  to  the  solution  of 
the  current-continuity  equation.  The  final  section 
describes  the  results  of  a  two-dimensional  simula¬ 
tion. 


Hence  the  value  n3  inside  the  well  may  be  rapidly 
calculated  using  Equation  (2.2)  with  the  value  of  c 
given  by  Equation  (2.3).  It  was  also  shown  that  if 
this  correction  factor  is  used  (rather  than  taking  its 
value  to  be  zero)  then  the  number  of  eigensolu¬ 
tions  needed  for  the  calculation  of  n2  may  be 
reduced  to  give  good  accuracy.  We  now  describe  a 
novel  discretisation  of  the  current-continuity  equa¬ 
tion.  In  the  isothermal  case  we  may  write 


J  —  —qimV<j)  (2.4) 


where  Ep  —  —q(j),  and  the  electron  density  is  a 
function  of  ?/>,  <f>9  x  and  y\  n  =  (j),  x ,  y ).  Firstly 

consider  the  x-component  of  Equation  (2.4). 
Keeping  y  constant  we  have 


dn 

d'tp 

~dn 

~d(f> 

dn 

1 

'dn 

dx 

y 

dx 

y 

d'lp 

+ 

4>,x,y 

dx. 

y 

p<t>. 

+ 

fax# 

dx 

which  we  write  more  simply  as 


dn 

dx 


dip  d(j> 

=  S"*  +  a^  +  "“ 


when  it  is  clear  which  variables  are  being  kept 
constant.  Substitution  of  the  expression  for  d(j>/dx 


32 


E.  A.  B.  COLE  et  al 


into  Equation  (2.4)  gives  the  first  component  of  J 
as 


J(x) 


( !tt 

\  dx 


nx\  n  dn 

—  ]n  —  qii — — . 
)  riff,  dx 


(2.5) 


In  the  interval  (xf  <  x  <Xi+\,y  =  yj)  define  the 
quantity 

Then  in  this  interval  we  can  approximate  Equation 
(2.5)  by 


,  df  dn 

J(x)-adin+0te 


where  a  =  qfxn^/n^  and  0=  —  q/m/n^,.  We  invoke 
the  Bernoulli  function  approach  [2]  to  this 
discretisation  by  assuming  that  0,a/0  and  df/dx 
are  constants  in  the  interval.  Then 


(2.6) 


where  B(x)  =  x/(ex- 1)  and  Ai=jc/+1-ac/.  The  y- 
component  may  be  similarly  discretised  by  intro¬ 
ducing  a  function  g(xhy)  =  t/j  +  (n/n^)iJ+^lnn 
which  is  defined  in  the  interval  (jc  =  xt,  y}  <  y  < 
yj+ 1 ).  Assumptions  similar  to  the  above  have  been 
derived  elsewhere  [2]  and  as  a  brief  indication  of 
their  validity,  note  that  in  the  non-degenerate  case 
we  have  a/0  =  —q/kT  when  n  is  given  in  purely 
exponential  terms. 

The  above  formulation  is  general  in  that  the 
exact  functional  form  of  the  electron  density  is  not 
specified.  Once  the  specification  is  given  then  the 
derivatives  n^,  n$,  nx  and  ny  may  be  calculated.  In 
the  non-isothermal  case,  Equation  (2.4)  will  not 
apply  and  the  generalisation  of  Equation  (2.5)  will 
contain  extra  terms  dn/dT.  The  solution  will  then 


be  given  using  the  C-function  approach  [2]  which 
generalises  the  Bernoulli  function  approach. 

In  order  to  reduce  the  computational  effort  in 
reaching  a  solution,  a  multigrid  approach  has  been 
developed  [3]  in  which  the  Poisson  and  current- 
continuity  equations  are  solved  using  multigrids 
while  the  Schrodinger  equation  is  solved  by  a  non¬ 
multigrid  scheme  at  each  level  to  provide  the 
expression  for  the  electron  density  n.  This  has 
increased  the  speed  of  the  solution  by  almost  five 
times  in  some  cases. 


3.  SIMULATION  RESULTS 

The  model  problem  is  that  of  a  two-layer  Al0.3 
Gao.7As-GaAs  HEMT.  The  ends  of  the  source, 
gate  and  drain  were  at  x  =  0.0pm,  0.1  pm,  0.4  pm, 
0.7  pm,  1.3  pm  and  1.4  pm,  the  total  depth  of  the 
device  was  Y=  0.3  pm  with  the  layer  interface  at 
7  =  0.1  pm.  The  doping  of  the  AlGaAs  and  GaAs 
was  taken  as  5xl023  m~3  and  lxlO19  m“3  respec¬ 
tively.  High  doping  regions  of  1.2x10  24  m-3  were 
taken  around  the  source  an  drain  to  simulate  ohmic 
contacts,  and  a  depletion  value  of  8x  1016m~3  was 
taken  on  the  contact  edges  between  source  and  gate 
and  gate  and  drain  to  pin  Ec  to  a  value  at  which  the 
quantum  well  could  form.  The  electron  temperature 
was  taken  as  a  constant  300  K.  The  mobilities  of  the 


FIGURE  3  Results  of  the  simulation  with  Vgs=  -0.1  V  and 
Vds  =  2.0  V,  with  all  distances  in  ^im,  showing  (a)  the  conduction 
band  Ec ,  (b)  the  electron  density  n,  and  c)  the  second  energy 
eigenfunction  fj. 
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FIGURE  3  (Continued). 


AlGaAs  and  GaAs  were  taken  as  0.2  m2V_1s_1  and 
0.8  m2V_1s_1  respectively. 

Results  shown  in  Figure  3  were  derived  for  the 
case  Fgs  =  -0. 1 V,  Vds  =  2.0  V.  A  multigrid  method 
was  used  to  solve  the  Poisson  and  current 
continuity  equations.  The  Schrodinger  equation 
was  solved  on  each  grid  by  a  non-multigrid 
method  to  provide  the  electron  density.  The 
eigenvalues  were  found  using  a  QL  algorithm  with 
implicit  shifts,  ordered,  and  the  first  L  eigenfunc¬ 
tions  found  by  back  substitution.  The  multigrid 
method  described  in  [3]  was  used.  Four  uniform 
grids  were  taken  with  the  coarsest  grid  being 
40x25  -  this  was  found  to  be  the  minimum  size  for 
rapid  calculation  of  the  exact  solution  on  the 
coarsest  grid  without  losing  the  quantum  well 
structure  in  the  y-direction.  The  coarse  grid 


solution  was  found  using  Newton  iteration  with 
automatic  stopping  based  on  the  sizes  of  the 
residuals  of  the  Poisson  and  current  continuity 
equations.  It  was  found  that  an  initial  number  of 
20  iterations  had  to  be  performed  on  this  solution 
without  quantum  correction  to  allow  the  quantum 
well  structure  to  become  established.  Ten  pre-  and 
post-smoothing  iterations  were  made  on  the  finer 
grids.  As  found  earlier  [4]  it  was  found  that  using 
only  three  eigenvalues  (L  =  2)  with  the  correction 
factor  of  Equation  (2.3)  provided  comparable 
results  with  those  of  the  benchmark  case  L  =  9 
with  no  correction  used,  thus  speeding  the 
numerical  solution.  Further,  solutions  could  be 
found  for  fixed  values  of  Vgs  and  Fds  without 
having  to  approach  them  in  small  increments. 
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MOMENTS:  The  Modular  Monte  Carlo 
Environment  for  Charge  Transport  Simulation, 
Overview  and  Applications 
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We  present  MOMENTS,  a  newly  developed  software  library  for  Monte  Carlo  simulation  of 
semiconductor  devices.  This  library  uses  object-oriented  design  principles  to  provide  a  flexible, 
extensible  toolset  that  allows  rapid  development  of  a  wide  variety  of  Monte  Carlo  simulation 
applications.  It  allows  concurrent  simulation  of  multiple  particle  species  (e.g.  electrons  and  holes) 
with  arbitrary  user-defined  interactions  between  species  (e.g.  generation  -  recombination  and 
carrier  -  carrier  scattering)  in  arbitrary  geometries  using  either  analytic  or  numerical  bandstructure 
representations.  The  modular  design  allows  virtually  all  simulation  parameters  to  be  freely  varied 
across  the  simulation  domain.  MOMENTS  also  takes  advantage  of  the  parallelism  inherent  in  the 
ensemble  Monte  Carlo  approach,  employing  a  scheme  that  can  support  a  wide  variety  of  parallel 
architectures  with  active  load  balancing.  To  demonstrate  some  of  the  library’s  capabilities,  we  also 
present  preliminary  results  from  a  GaAs  avalanche  photodiode  (APD)  simulator  based  on 
MOMENTS. 

Keywords:  Simulation,  Monte  Carlo,  object-oriented,  semiconductors,  transport,  parallel, 
avalanche  photodiode 


INTRODUCTION 

Conventionally,  applications  of  the  Monte  Carlo 
method  to  semiconductor  device  simulation  have 
come  in  the  form  of  highly  specialized  codes 
designed  specifically  to  address  a  relatively  con¬ 
strained  and  well-defined  set  of  research  questions. 
The  scope  of  these  specialized  tools  has  been 
further  limited  by  the  tremendous  computational 
demands  of  Monte  Carlo  simulation.  However,  as 
both  conventional  silicon  and  optoelectronic 
device  designs  continue  to  scale  dimensionally, 
the  Monte  Carlo  method,  with  its  unmatched 
ability,  to  accurately  characterize  the  nonlocal 
transport  phenomena  that  dominate  the  behavior 


of  devices  with  very  small  geometries,  will  become 
an  increasingly  attractive  simulation  option.  In 
order  to  satisfy  the  demands  of  the  TCAD 
community,  future  Monte  Carlo  simulators  should 
possess  three  key  properties:  flexibility  -  allowing 
the  code  to  be  easily  adapted  to  explore  a  variety 
of  devices,  geometries,  and  materials;  extensibil¬ 
ity  -  permitting  the  user  to  add  new  models  and 
capabilities  to  the  simulation  package  as  needed 
without  requiring  modification  of  existing  code; 
and  performance  -  so  that  the  simulator  can  be 
used  cost  effectively  on  a  day-to-day  basis. 

We  have  developed  a  new  Monte  Carlo  simula¬ 
tion  environment,  MOMENTS,  that  attempts  to 
address  these  three  key  requirements  by  exploiting 
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advanced  object-oriented  and  parallel  software 
design  in  C  +  + .  Rather  than  attempting  the 
(probably)  impossible  task  of  creating  a  single 
application  capable  of  addressing  every  antici¬ 
pated  research  need,  we  have  instead  developed 
MOMENTS  as  a  library  of  objects  that  can  be 
quickly  and  easily  assembled  to  create  a  variety  of 
simulation  applications.  Thus  MOMENTS  can  be 
thought  of  as  a  toolbox  or  application  program¬ 
ming  interface  (API)  for  Monte  Carlo  simulation. 
The  MOMENTS  API  is  designed  to  be  easy  to  use 
but  has  many  sophisticated  features  and  is  general 
enough  to  subsume  most  existing  Monte  Carlo 
models,  allowing  those  models  to  be  leveraged  for 
use  in  new  problems. 

DESIGN  PRINCIPLES 

MOMENTS  comprises  a  collection  of  C+  + 
classes  designed  to  represent  the  functional  com¬ 
ponents  of  a  Monte  Carlo  simulation.  These 
classes  are  organized  hierarchically,  making  full 
use  of  the  inheritance  properties  of  object  oriented 
software.  The  root  level  classes  have  intentionally 
simple  interface  specifications.  This  promotes 
maximum  flexibility,  since  any  child  class  which 
supports  these  very  general  interfaces  can  be  easily 
integrated  into  an  existing  simulation.  Simulations 
are  defined  by  the  interactions  between  objects 
based  on  three  primary  root  level  classes:  SimOb- 
ject,  SimRegion,  and  SimEvent  (see  Fig.  1).  The 
SimObject  class  is  the  heart  of  MOMENTS.  Each 
SimObject  represents  a  self-contained  entity  that, 
based  on  its  surroundings,  knows  how  to  evolve  its 
own  state  over  a  fixed  time  interval.  Classes  to 
represent  electrons  and  holes  are  descendants  of 
SimObject.  In  addition,  the  user  may  define  other 
particle  species  based  on  the  SimObject  class  as 
long  as  their  time-evolving  behavior  can  be  defined 
in  the  context  of  the  simulation.  Possible  examples 
include  photons,  phonons,  excitons,  or  mobile 
ions.  This  makes  MOMENTS  useful  in  a  wide 
variety  of  simulation  applications,  including  pro¬ 
cess  simulation.  Since  SimObjects  are  self-con- 


FIGURE  1  Schematic  showing  the  three  root  functional 
classes  of  MOMENTS,  examples  of  simulation  elements  they 
are  designed  to  represent,  and  their  primary  methods. 

tained,  several  different  particle  species  can  easily 
be  simulated  concurrently  and  new  instances  of 
simulated  particles  can  be  created  “on-the-fly’\ 
This  allows  direct  study  of  multi-species  interac¬ 
tions,  including  direct  simulation  of  generation- 
recombination  processes. 

The  SimRegion  class  is  designed  to  encapsulate 
spatial  structure  data  in  the  most  general  way 
possible;  the  only  requirement  for  SimRegion 
objects  is  that  they  must  determine  whether  or 
not  they  contain  a  given  spatial  coordinate.  The 
SimRegion  class  also  supports  a  hierarchy  in 
which  each  SimRegion  object  can  contain  one  or 
more  sub-regions.  This  allows  structures  of  arbi¬ 
trary  complexity  and  dimensionality  to  be  repre¬ 
sented.  It  also  allows  the  programmer  to  create 
anything  from  bulk  up  through  full  3-D  simula¬ 
tions  without  altering  the  underlying  models  in  the 
code.  SimRegion  objects,  which  comprise  both 
real-space  and  fc-space  structures  (bandstructures), 
also  serve  as  containers  for  data  (e.g.  material 
data,  electric  fields,  etc.)  using  the  SimRegistry 
mechanism  described  below.  This  data  storage  is 
also  hierarchical.  If  a  SimObject  requests  data 
from  a  sub-region  for  which  that  data  has  not  been 
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specified,  the  request  will  automatically  be  passed 
to  that  sub-region’s  parent  region. 

The  SimEvent  class  represents  stochastically 
occurring  events  that  alter  the  state  of  a  SimObject 
(e.g.  scattering  mechanisms).  SimEvents  related  to 
a  given  SimObject  class  are  collected  in  Sim- 
EventGroups,  which  can  automatically  handle 
event  selection  using  an  efficient  adaptive  self¬ 
scattering  technique.  Different  SimEventGroups 
can  be  specified  in  different  regions  of  the 
simulation  domain.  SimEvents  can  also  be  loaded 
into  and  unloaded  from  SimEventGroups  at  will, 
even  while  the  simulation  is  running.  This  allows 
transport  simulators  based  on  MOMENTS  to 
adaptively  select  the  most  appropriate  scattering 
models  for  a  given  portion  of  the  simulation.  It  is 
fairly  straightforward  to  create  new  scattering 
models  or  modify  the  behavior  of  existing  ones 
using  C++  inheritance. 

MOMENTS  derives  its  flexibility  from  this 
modular  design.  Classes  belonging  to  each  of  the 
three  functional  groups  described  above  can  be 
mixed  and  matched  as  needed,  even  during  the 
course  of  a  running  simulation.  MOMENTS  is 
also  fully  extensible;  it  is  relatively  easy  for  users  to 
create  new  SimObject,  SimRegion,  and  SimEvent 
subclasses,  using  the  inheritance  properties  of 
C  +  + ,  and  integrate  them  into  existing  codes. 
The  modular  MOMENTS  structure  encourages 
collaboration  and  simplifies  code  maintenance.  It 
also  provides  potential  performance  benefits.  Since 
virtually  every  aspect  of  the  simulation,  including 
scattering  models,  self-scattering  rates,  bandstruc- 
ture  representations,  and  even  interpolation 
schemes,  can  be  varied  freely  across  different 
regions  of  the  simulation  domain,  the  user  can 
focus  the  greatest  degree  of  computational  effort  in 
regions  of  the  domain  where  the  greatest  accuracy 
is  desired. 


PARALLELISM  IN  MOMENTS 

MOMENTS  addresses  performance  concerns  pri¬ 
marily  by  exploiting  the  opportunities  for  paralle¬ 


lism  inherent  in  the  Monte  Carlo  method.  In  the 
MOMENTS  system,  the  collections  of  SimObjects 
necessary  for  ensemble  Monte  Carlo  simulations 
are  collected  in  one  or  more  SimPopulation 
objects.  The  members  of  a  SimPopulation  can 
then  be  distributed  to  one  or  more  “contexts”  - 
independent  subgroups  of  SimObjects  that  evolve 
in  parallel  under  independent  threads  of  control. 
The  members  of  each  context  are  synchronized 
and  allowed  to  interact  at  predetermined  intervals 
in  a  manner  analogous  to  Bulk  Synchronous 
Parallelism  [1]  (see  Fig.  2).  While  contexts  are 
busy  handling  the  evolution  of  their  respective 
SimObjects,  the  main  thread  is  free  to  perform 
subsidiary  tasks  such  as  re-calculation  of  electric 
fields  using  a  Poisson  solver  or  I/O.  The  MO¬ 
MENTS  context  scheme  is  applicable  to  a  wide 
variety  of  parallel  architectures,  including  hetero¬ 
geneous  distributed  systems,  and  supports  active 
load  balancing.  Such  parallelization  of  the  Monte 
Carlo  method  raises  several  interesting  issues.  For 
example,  the  indeterminate  timing  of  calls  to  a 
given  random  number  generating  sequence  by 
different  contexts  would  tend  to  prevent  reproduc¬ 
tion  of  a  given  sequence  of  events  -  a  useful 
debugging  technique.  MOMENTS  addresses  this 


Main  Thread 


Context  1 


Context  2 


SimPopulation. evolve(At) 

SimContext[1  J.wakeQ 
SimContext[2].wake()  SimObjectjl].  evolve  (At) 

SimObject[2].  evolve  (At)  I 
SimObject[3]  .evolve  (At) 


SimObject[1  ].evolve(At) 
SimObject[2].evolve(At) 
SimObject[3].evolve(At) 


Main  thread  free  to 
perform  subsidiary 
tasks  (I/O,  Poisson, 
etc.) 


Apply  Boundary  Condi¬ 
tions 

Distribute  Newly  created 
SimObjects  to  SimCon- 
texts 


SimObject[N].evolve(At) 

DONE 

sleep() 


REPEAT 


FIGURE  2  Illustration  of  a  simulation  cycle  for  a  SimPopu¬ 
lation  object  with  two  attached  contexts. 
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problem  by  providing  each  context  with  its  own 
random  number  generating  sequence.  Calls  to  the 
SimRand()  function  are  automatically  directed  to 
the  generating  sequence  for  the  calling  thread, 
shielding  the  user  from  this  complexity.  Another 
problem  common  to  all  parallel  codes  is  data 
contention  by  multiple  threads  of  control.  Many 
key  data  structures  in  MOMENTS  have  been 
carefully  designed  to  avoid  problems  caused  by 
data-contention  among  multiple  contexts. 


SUPPORT  CLASSES 

In  addition  to  the  three  main  class  hierarchies 
described  above,  MOMENTS  includes  a  number 
of  support  classes.  Grids  can  be  defined  using  the 
SimGrid,  SimGridPoint,  and  SimGridCell  classes. 
These  grid  classes  are  very  general  and  can  support 
all  types  of  structured  and  unstructured  grids  in 
one,  two,  or  three  dimensions.  Boundary  condi¬ 
tions  are  handled  by  the  SimSource  and  SimSink 
classes,  which  can  emulate  all  types  of  conven¬ 
tional  boundaries.  In  addition,  these  classes  can  be 
used  to  simulate  absorption  (recombination)  or 
emission  (photoemission,  thermal  generation,  etc.) 
throughout  entire  regions  of  the  simulation 
domain.  The  SimRecorder  class  can  be  used  to 
automatically  record  the  characteristics  of  one  or 
more  SimObjects  at  specified  intervals.  MO¬ 
MENTS  is  also  equipped  with  a  variety  of  objects 
to  handle  thread  control  in  parallel  implementa¬ 
tions,  including  a  number  of  thread-safe  data 
structures. 

One  of  the  key  support  classes  is  the  SimReg- 
istry  -  a  construct  that  allows  members  of  a 
“client”  class  to  transparently  create  storage  in 
every  member  of  a  SimRegistry  “host”  class.  The 
client  can  then  address  the  “registered”  data  item 
for  a  given  object  of  the  host  class  using  an  array- 
style  notation.  Storage  for  this  registered  data  is 
automatically  created  in  any  new  objects  based  on 
the  host  class.  The  SimRegistry  concept  is 
important  because  it  allows  users  to  extend  the 


functionality  of  existing  host  classes  without 
actually  having  to  modify  the  source  code  of  the 
host  class.  Also,  any  number  of  client  classes  can 
use  the  same  host  class  for  storage.  SimRegion, 
SimGridPoint,  SimBandStruct,  and  SimThread 
are  all  examples  of  SimRegistry  host  classes. 


GAAS  APD  SIMULATION 

As  an  example  of  the  capabilities  of  MOMENTS, 
we  present  preliminary  results  from  a  GaAs 
avalanche  photodiode  (APD)  simulator  based  on 
the  MOMENTS  library.  This  simulator  will  be 
used  to  study  the  anomalous  multiplication  noise- 
suppression  recently  detected  in  III  -  V  compound 
separate  absorption  and  multiplication  (SAM) 
APDs  with  very  thin  (<  200  nm)  multiplication 
regions  [2,  3].  Conventional  APD  theory  relates 
the  multiplication  noise  to  the  ratio  of  hole  to 
electron  ionizations  (k)  in  the  multiplication  region 
of  the  device.  Smaller  values  of  k  are  associated 
with  less  noise.  The  lower  observed  noise  values  in 
devices  with  very  thin  multiplication  regions  may 
be  associated  with  phenomena  related  to  the 
nonlocality  of  the  impact  ionization  process.  One 
such  phenomenon  is  known  as  the  “dead  space 
effect”,  referring  to  the  fact  that  charge  carriers 
with  low  initial  energies  must  be  accelerated 
through  a  high  field  for  a  sufficient  distance  to 
acquire  enough  energy  to  ionize. 

Obtaining  a  detailed  understanding  of  the  dead 
space  effect  and  its  relationship  to  APD  device 
geometry  and  noise  figures  requires  simultaneous 
simulation  of  both  electrons  and  holes,  including 
the  ability  to  simulate  new  carriers  as  they  are 
generated  by  impact  ionization  events.  Moreover, 
accurate  characterization  of  transport  and  impact 
ionization  in  high-field  regions  entails  the  use  of 
realistic  numerically  calculated  bandstructure  data 
for  both  carrier  types.  The  capabilities  of  MO¬ 
MENTS  make  it  ideally  suited  for  the  study  of  this 
complex  problem.  Concurrent  simulation  of  multi¬ 
ple  carrier  species  is  trivial  in  MOMENTS,  thanks 
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to  the  SimObject  abstraction.  The  library’s  ability 
to  generate  new  carriers  on  the  fly  allows  a 
rigorous  simulation  of  the  impact  ionization 
process.  Finally,  the  modular  MOMENTS  design 
simplifies  incorporation  of  numerical  bandstruc- 
ture  data  based  on  nonlocal  pseudopotential 
calculations.  The  MOMENTS  package  was  used 
to  create  a  simple  1-D  GaAs  APD  simulator.  The 
simulator  employs  realistic  nonlocal  pseudopoten¬ 
tial  bandstructure  representations  for  both  carrier 
types.  In  addition  to  impact  ionization,  intra  and 
inter-valley  scatterings  by  both  optical  and  acous¬ 
tic  phonons  are  included.  The  simulator,  as¬ 
sembled  from  MOMENTS  components,  requires 
less  than  a  page  of  computer  code. 

Preliminary  results  from  this  new  simulator  are 
encouraging.  Figure  3  is  a  graph  of  gain  and  k  for 
several  multiplication  region  thicknesses  with  a 
fixed  field  of  700  kV/cm  in  the  multiplication 
region.  The  k  values  demonstrate  the  same  rapidly 
decreasing  trend  for  thinner  multiplication  regions 
seen  in  experimental  devices.  Figure  4  shows  the 
relative  ionization  rate  for  each  carrier  type  as  a 
function  of  position  along  a  100  nm  multiplication 
layer.  In  this  graph,  electrons  are  injected  at  the 
right  boundary  (x  =  0)  and  accelerate  to  the  left, 
while  generated  holes  are  accelerated  back  to  the 
right.  There  is  a  clear  “dead  space”  for  each  carrier 
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FIGURE  3  Simulated  gain  and  fc-ratio  for  several  GaAs 
APD  multiplication  layer  thicknesses  with  an  applied  field  of 
700  kV/cm. 
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FIGURE  4  Simulated  relative  ionization  rates  as  a  function 
of  position  across  a  100  nm  multiplication  layer  with  an  applied 
field  of  700  kV/cm. 

type,  indicated  by  the  shaded  regions.  We  believe 
this  is  the  first  quantification  of  dead  space  in 
APDs.  Note  that  the  dead  space  region  for  holes  is 
approximately  3-4  times  longer  than  that  for 
electrons,  indicating  that  as  the  multiplication 
layer  shrinks,  hole  ionizations  will  be  preferentially 
suppressed.  Thus  it  appears  that  dead  space  may 
indeed  explain  the  anomalously  low  noise  levels 
observed  in  thin  multiplication  layer  APDs, 
although  further  study  will  be  necessary  for 
confirmation. 


CONCLUSION 

In  conclusion,  we  have  developed  MOMENTS,  a 
modular  library  of  C  +  +  objects  that  serves  as  a 
framework  for  development  of  high-performance 
Monte  Carlo  simulation  applications.  The  library 
supports  parallel  execution  in  an  architecture- 
independent  manner,  and  provides  tremendous 
flexibility,  extensibility,  maintainability,  and  ease- 
of-use  -  properties  lacking  in  most  conventional 
Monte  Carlo  simulators.  To  demonstrate  the 
advanced  capabilities  of  MOMENTS,  we  have 
presented  preliminary  results  from  a  GaAs  APD 
simulator  based  on  the  MOMENTS  library.  To 
the  best  of  our  knowledge  this  Monte  Carlo  tool  is 
the  first  to  concurrently  simulate  the  transport  of 
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both  electrons  and  holes  using  realistic  numerical 
bandstructure  representations  for  both  carrier 
species  with  direct  simulation  of  the  carrier  genera¬ 
tion  process.  We  believe  that  MOMENTS  will  help 
make  the  Monte  Carlo  method  much  more 
accessible  to  the  TCAD  research  community  as  a 
whole.  This  should  be  a  tool  that  will  be  especially 
attractive  to  researchers  working  on  novel  device 
structures  or  in  novel  material  systems. 
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High-Field  Hole  Transport  in  Strained  Si 
and  SiGe  by  Monte  Carlo  Simulation: 
Full  Band  Versus  Analytic  Band  Models 
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Monte  Carlo  results  are  presented  for  the  velocity-field  characteristics  of  holes  in  (i) 
unstrained  Si,  (ii)  strained  Si  and  (iii)  strained  SiGe  using  a  full  band  model  as  well  as  an 
analytic  nonparabolic  and  anisotropic  band  structure  description.  The  full  band  Monte 
Carlo  simulations  show  a  strong  enhancement  of  the  drift  velocity  in  strained  Si  up  to 
intermediate  fields,  but  yield  the  same  saturation  velocity  as  in  unstrained  Si.  The  drift 
velocity  in  strained  SiGe  is  also  significantly  enhanced  for  low  fields  while  being 
substantially  reduced  in  the  high-field  regime.  The  results  of  the  analytic  band  models 
agree  well  with  the  full  band  results  up  to  medium  field  strengths  and  only  the  saturation 
velocity  is  significantly  underestimated. 


Keywords:  Monte  Carlo  simulation,  analytic  and  full  band  structures,  strained  Si  and  SiGe 


1.  INTRODUCTION 

The  progress  in  epitaxial  growth  techniques  of 
unstrained  and  strained  SiGe  layers  have  led  to 
intensified  efforts  to  explore  the  potential  perfor¬ 
mance  enhancements  in  SiGe  based  devices.  In 
particular,  the  practical  usefulness  of  p- MOSFETs 
with  a  channel  consisting  of  strained  Si  [1]  or 
strained  SiGe  [2]  has  been  recently  demonstrated. 
Since  field  effect  devices  operate  in  the  low-field 
and  in  the  high-field  regime,  reliable  modeling  of 
hole  transport  is  important  for  both  cases.  How¬ 


ever,  previous  publications  on  hole  transport  in 
strained  Si  and  SiGe  covered  only  the  low  field 
regime  [3]  or  were  restricted  to  strained  SiGe  and 
electric  field  strengths  below  20  kV/cm  [4]. 
Hence,  there  is  a  clear  need  for  investigations  of 
high-field  effects  like  velocity  saturation  where  the 
consideration  of  the  full  band  structure  is  often 
necessary  for  accurate  results.  On  the  other  hand, 
for  devices  with  realistic  Germanium  profiles  full 
band  Monte  Carlo  simulations  still  involve  an  un¬ 
manageable  computational  burden  (e.g.  prohibitive 
memory  requirements)  and  analytic  band  structure 
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approximations  have  to  be  used  instead.  The  aim  of 
this  paper  is  therefore  twofold:  On  one  hand,  we 
perform  for  the  first  time  full  band  Monte  Carlo 
simulations  for  strained  Si  and  SiGe  in  the  high-field 
regime.  On  the  other  hand,  we  present  a  simple 
analytic  hole  band  model  and  evaluate  its  range  of 
validity. 


2.  MODEL  DESCRIPTION 


mobility  obtained  with  the  parabolic  fits  to  the  full 
band  model.  This  condition  yields  for  a  =  0  in  the 
unstrained  case  (ray  =  m±  =  m) 


2/5  3/5 

m  =  ™cond™DOS 


(2) 


Finally,  using  this  mass  m  the  nonparabolicity 
factor  a  is  obtained  from  fitting  the  DOS  up  to 
1  eV.  A  similar  procedure  applies  in  the  strained 
case. 


The  full  band  model  for  strained  Si  or  SiGe  is 
obtained  by  nonlocal  empirical  pseudopotential 
calculations  including  spin-orbit  interaction  [9]. 
For  the  analytic  band  structure  we  neglect  the 
warping  of  the  three  valence  bands  v  =  1,2,3  and 
use  a  simple  parametrization  according  to 


E{\+avE)~~  ^  + 


kl  kl  k2 


W||)V  m  ||, v  «x,v 


(1) 


with  E  =  £—eo,v  because  of  the  feasibility  of  this 
formula  for  applications.  The  scattering  mechan¬ 
isms  included  are  optical  phonons  and  acoustic 
phonons  in  the  isotropic  and  elastic  equipartition 
approximation.  In  SiGe  both  Si-type  and  Ge-type 
phonons  are  considered  and  alloy  scattering  is 
taken  into  account  with  the  alloy  scattering 
potential  adjusted  to  drift  mobility  measurements 
in  unstrained  SiGe  [10].  Exactly  the  same  coupling 
constants  are  used  with  the  full  band  and  the 
analytic  band  model.  The  parameters  aV9  m\\iV  and 
m±yV  are  adapted  to  the  full  band  structure  for  the 
purpose  of  transport  applications.  The  starting 
point  is  therefore  the  expression  for  the  Ohmic 
drift  mobility  which  involves  for  the  scattering 
processes  used  only  the  Density  Of  States  (DOS) 
and  the  square  of  the  group  velocity  averaged  over 
an  energy  surface  V2  [11].  Then  a  parabolic 
expression  is  used  for  each  band  to  determine  the 
masses  raDO s  and  mcond  by  adjusting  the  DOS  and 
V2,  respectively,  up  to  about  40  meV  above  the 
band  edge  to  the  respective  full  band  results.  For  a 
good  transport  description  the  mobility  of  the 
analytic  band  model  in  Eq.  (1)  must  equal  the 


3.  VERIFICATION 

In  Figures  1,  2,  3  and  4  drift  mobilities  and  drift 
velocities  resulting  from  the  full  band  and  the 
analytic  band  model  are  compared  with  experi¬ 
mental  data  in  the  case  of  unstrained  Si  and  Ge 
because  an  accurate  reproduction  of  the  experi¬ 
ments  is  essential  in  view  of  the  increased 
importance  of  details  of  phonon  and  band  models 
in  the  strained  case.  Overall  good  agreement  is 
achieved.  Especially  the  full  band  model  in  Figure 
2  reproduces  accurately  the  anisotropy  of  the 
velocity-field  characteristics  as  well  as  the  satura¬ 
tion  drift  velocity  of  Ref.  [7].  Within  the  isotropic 
band  approximation  (unstrained  case)  also  the 


FIGURE  1  Temperature  dependence  of  Ohmic  drift  mobility 
for  holes  in  unstrained  Si:  comparison  of  full  band  model, 
analytic  band  model  and  experimental  results. 
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FIGURE  2  Velocity-field  characteristics  of  the  full  band 
model  for  unstrained  Si  at  77  and  300  K  in  comparison  with 
experimental  results. 


FIGURE  4  Velocity-field  characteristics  of  the  analytic  band 
model  for  unstrained  Si  at  77  and  300  K  in  comparison  with 
experimental  results. 


FIGURE  3  Velocity-field  characteristics  of  the  full  band 
model  for  unstrained  Ge  at  77  and  220  K  in  comparison  with 
experimental  results. 


analytic  band  model  yields  surprisingly  good 
results  and  only  significantly  underestimates  the 
drift  velocity  above  50  kV/cm. 


4.  RESULTS 

In  Figures  5  and  6  the  high-field  results  for 
strained  Si  and  SiGe  are  shown.  While  the  value 
of  the  saturation  drift  velocity  in  unstrained  Si  is 


ELECTRIC  FIELD  (kV/cm) 

FIGURE  5  Results  of  the  full  band  model  for  the  velocity- 
field  characteristics  at  300  K  in  strained  Si  grown  on  a  Si0.7  Ge03 
substrate,  in  unstrained  Si  and  in  strained  Si0.6  Ge0.4  grown  on 
a  Si  substrate. 


retained  in  strained  Si,  the  drift  velocity  at  lower 
fields  is  considerably  improved  due  to  the  en¬ 
hanced  population  of  the  light  hole  band.  In 
contrast,  the  saturation  velocity  is  reduced  in 
strained  SiGe,  but  there  is  still  a  substantial 
improvement  up  to  intermediate  fields.  But  please 
keep  in  mind  that  no  realistic  estimate  of  the 
corresponding  device  performance  can  be  based  on 
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ELECTRIC  FIELD  (kV/cm) 


FIGURE  6  Results  of  the  analytic  band  model  for  the 
velocity-field  characteristics  at  300  K  in  strained  Si  grown  on 
a  Si0.7  Geo.3  substrate,  in  unstrained  Si  and  in  strained  Si0.6  Ge0.4 
grown  on  a  Si  substrate. 

Figure  5  alone  because  advantages  like  the 
possibility  of  modulation  doping  have  to  be 
considered  for  this  purpose  as  well.  The  analytic 
band  model  again  underestimates  the  drift  velocity 
above  50  kV/cm  and  somewhat  overestimates 
anisotropy. 
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Advanced  MOSFET  for  ULSI  and  novel  silicon-based  devices  require  the  use  of  ultra- 
thin  tunneling  oxides  where  non-uniformity  is  often  present.  We  report  on  our 
theoretical  study  of  how  tunneling  properties  of  ultra-thin  oxides  are  affected  by 
roughness  at  the  silicon/oxide  interface.  The  effect  of  rough  interfacial  topography  is 
accounted  for  by  using  the  Planar  Supercell  Stack  Method  (PSSM)  which  can  accurately 
and  efficiently  compute  scattering  properties  of  3D  supercell  structures.  Our  results 
indicate  that  while  interface  roughness  effects  can  be  substantial  in  the  direct  tunneling 
regime,  they  are  less  important  in  the  Fowler-Nordheim  regime. 


Keywords:  Ultrathin,  oxide,  Si02,  tunneling,  interface  roughness 


1.  INTRODUCTION 

The  continued  scaling  of  Metal-Insulator-Semi- 
conductor  (MIS)  device  structures  has  brought 
much  attention  to  ultrathin  oxides.  Normal 
operation  of  a  MOSFET  with  1.5  nm  direct- 
tunneling  gate  oxide  has  been  reported  [1]. 
Tunneling  through  oxide  barriers,  as  a  mechanism 
for  leakage  currents,  is  of  particular  interest. 
Typical  theoretical  analysis  models  the  oxide  layer 
as  a  ID  barrier  with  an  effective  barrier  height  and 
an  effective  mass.  The  barrier  height  may  be 
obtained  experimentally  or  treated  as  a  fitting 
parameter,  while  the  effective  mass  is  normally 


used  as  a  parameter  for  fitting  measured  current- 
voltage  (I-V)  characteristics  [2].  Tunneling  coeffi¬ 
cients  can  be  calculated  using  the  well-known 
WKB  approximation.  Integration  of  tunneling 
coefficient  curves,  with  the  appropriate  Fermi 
factors  describing  carrier  statistics,  then  yields  an 
analytical  I-V  curve  formula  [3]  which  can  be  used 
conveniently  for  comparison  with  experimental 
data.  A  somewhat  similar  treatment  uses  multiple 
scattering  theory  instead  of  the  WKB  approxima¬ 
tion  to  compute  tunneling  coefficients  to  provide 
clarification  of  mechanisms  for  leakage  currents 
through  ultrathin  oxides  [4].  A  still  more  advanced 
treatment  solves  Poisson  and  Schrodinger  equa- 
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tions  self-consistently  for  accumulated  layers  in 
metal-oxide-semiconductor  devices  to  calculate 
tunneling  currents  [5], 

An  important  aspect  typically  not  treated  in 
these  models  is  oxide  non-uniformity.  Cundiff  and 
co-workers  [6]  showed  experimental  evidence  that 
roughness  at  the  Si/Si02  interface  increases  with 
decreasing  oxide  layer  thickness.  This  makes 
interface  roughness  particularly  important  in 
ultrathin  oxides.  Interface  roughness  can  have 
dramatic  effects  on  the  current-voltage  character¬ 
istics  of  MIS  structures.  It  has  been  shown  that 
constant  current  stressing  of  MIS  structures  in  the 
Fowler-Nordheim  tunneling  regime  can  induce 
non-uniformities  at  the  Si/Si02  interface  [7],  This 
quasi-breakdown  in  the  oxide  can  lead  to  dramatic 
increases  in  direct  tunnel  current.  In  this  paper  we 
treat  the  MIS  tunnel  structure  using  a  3D  model 
which  is  similar  to  the  standard  models,  but  allows 
for  3D  potential  variations  associated  with  Si/Si02 
interfacial  non-uniformity. 

2.  METHOD 

Standard  treatment  uses  a  one-dimensional  poten¬ 
tial  to  describe  the  oxide  barrier.  With  interfacial 
non-uniformity,  we  need  to  use  a  three-dimen¬ 
sional  description.  In  principle,  the  variations  in 
the  non-uniform  potential  extend  indefinitely  in 
the  directions  along  the  interface.  In  practice,  we 
do  not  perform  computation  for  an  infinite 
domain,  but  use  instead  a  quasi-3D  supercell 
geometry  to  approximate  the  physical  problem. 
We  treat  the  problem  of  tunneling  through  a  non- 
uniform  barrier  using  the  Planar  Supercell  Stack 
Method  (PSSM)  [8],  The  device  structure  treated 
by  PSSM  consists  of  an  active  layer  sandwiched 
between  two  semi-infinite  flat  band  electrode 
regions.  Let  the  z  axis  be  the  direction  of  current 
flow.  Then  the  active  region  is  composed  of  a  stack 
of  Nz  layers  perpendicular  to  the  z-direction,  with 
each  layer  containing  a  periodic  array  of  rectan¬ 
gular  planar  supercells  of  NxxNy  sites.  A  one- 
band  nearest-neighbor  tight-binding  Hamiltonian 


is  used  to  describe  the  potential  over  this  volume 
of  interest.  Our  model  is  formally  equivalent  to  the 
one-band  effective  mass  equation. 

h2  1 

_yv'^)vv’+F(x)v’=^  w 

discretized  over  a  Cartesian  grid,  and  subject  to 
periodic  boundary  conditions  (with  supercell 
periodicity)  in  the  x-  and  y-directions,  and  open 
boundary  conditions  in  the  z-direction.  PSSM 
solves  the  quantum  mechanical  scattering  problem 
exactly  for  the  3D  geometry  described  by  the 
planar  supercell  stack,  and  allows  us  to  compute 
transmission  coefficients  with  a  high  degree  of 
numerical  accuracy  and  efficiency.  Note  that  even 
though  the  supercell  geometry  imposes  an  artificial 
periodicity  to  make  computations  tractable,  the 
use  sufficiently  large  supercells  can  minimize 
supercell  artifacts  and  yield  excellent  descriptions 
of  the  physical  problem. 

3.  RESULTS  AND  DISCUSSION 

Our  model  of  the  MOS  tunnel  structure  consists  of 
a  metal  (or  poly-Si)  electrode,  followed  by  a  pure 
oxide  layer,  then  a  rough  interfacial  layer,  and 
finally  a  silicon  electrode.  We  assume  that  the 
rough  interfacial  layer  consists  of  a  50% -50% 
mixture  of  oxide  and  Si  in  random  configurations. 
The  Si  sites,  and  the  oxide  sites,  for  that  matter, 
may  aggregate  and  form  patches.  We  will  call  these 
silicon  patches  islands,  and  characterize  them  by 
their  lateral  extent  (island  size)  and  the  thickness 
of  the  interfacial  layer  (island  height).  For 
convenience,  we  also  assume  flat-band  conditions 
in  the  metal  and  silicon  electrodes,  and  let  all  the 
voltage  drop  occur  in  the  oxide  and  the  rough 
interface.  The  potential  barrier  height  and  the 
effective  mass  of  the  oxide  barrier  is  taken  to  be 
3.22  eV,  and  0.35  m0,  respectively.  We  use  a  cubic 
mesh  with  discretization  distance  of  0.13575  nm, 
and  64x64  planar  supercells  in  our  simulations. 

Figure  1  shows  transmission  coefficient  spectra 
for  three  MOS  tunnel  structures  with  rough  Si/ 
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Direct  Tunneling  thru  Si02 


Leff(Si02)=1.5nm  Rough  SiCX/Si  Interface 


FIGURE  1  Transmission  coefficient  spectra  for  MOS  tunnel 
structures  with  rough  Si/Si02  interfaces  characterized  by 
different  island  sizes,  at  zero  applied  bias.  For  comparison, 
results  for  ID  models  using  effective  barrier  thickness  and 
effective  barrier  height  are  also  shown. 


Si02  interfaces  characterized  by  island  sizes  of  A  = 
0.27, 1.2  and  2.2  nm.  The  pure  oxide  tunnel  barrier 
layer  is  1.36  nm  thick,  and  the  height  of  the  rough 
interfacial  layer  is  0.28  nm.  Note  that  tunneling 
probabilities  increase  considerably  with  island  size. 
Results  of  two  ID  models  are  also  plotted  for 
comparison.  The  first  treats  the  oxide  and  the 
interfacial  layer  as  having  a  combined  effective 
thickness  of  1.5  nm  (the  thickness  of  the  pure  oxide 
layer,  plus  half  the  thickness  of  the  interfacial 
layer);  the  second  treats  the  interfacial  layer  as 
having  a  potential  height  which  is  the  average  of 
the  oxide  and  the  silicon  potentials  (i.e.,  the  virtual 
crystal  approximation,  or  VCA  potential).  Note 
that  the  effective  thickness  model  does  not  offer  a 
good  description  of  the  rough  interfaces.  The  VCA 
result  agrees  well  only  with  that  obtained  for  the 
sample  with  the  small  island  size.  The  reason  for 
the  agreement  is  because  when  island  sizes  are 


smaller  than  the  deBroglie  wavelength  of  the 
incoming  electron,  the  random  potential  at  the 
interface  is  only  seen  by  the  electron  in  an 
averaged  (VCA)  sense.  In  structures  where  island 
sizes  become  larger  than  the  electron  deBroglie 
wavelength,  an  electron,  in  a  sense,  finds  its  way 
through  the  “softer”  spots  (portions  with  lower 
potential,  i.e.,  the  silicon  islands)  of  the  interfacial 
layer,  and  thereby  enhance  its  tunneling  prob¬ 
ability. 

This  is  demonstrated  in  Figure  2.  The  top  panel 
shows  the  transmission  coefficient  spectra  of  the 
A  =  0.27  nm  and  A  =  2.2  nm  structures  again  for 
reference,  while  the  bottom  panel  shows  the  curves 
of  island  transmission  fraction,  defined  as  the  ratio 
of  the  sum  of  probability  densities  over  all  silicon 
sites  in  the  interfacial  layer  divided  by  the  total 
probability  density  in  the  interfacial  layer.  Since 
the  interfacial  layer  consists  of  50%  silicon  sites 
and  50%  oxide  sites,  if  an  electron  shows  no 
preference  for  tunneling  through  the  silicon  islands 


Direct  Tunneling  thru  Si02 

Le[f{Si02)=1 .5nm  Rough  SiCySi  Interface 
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FIGURE  2  Transmission  coefficients  and  island  transmission 
fractions  for  two  MOS  structures  with  rough  interfaces 
characterized  by  different  island  sizes,  at  zero  applied  bias. 
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or  the  oxides,  the  island  transmission  fraction 
would  be  0.5;  a  fraction  greater  than  0.5  indicates 
a  preference  for  tunneling  through  the  silicon 
islands.  Our  results  show  that  in  the  A  =  0.27  nm 
(small  island)  structure,  there  are  essentially  no 
preferential  pathways  through  the  roughness  layer. 
However,  in  the  A  =  2.2  nm  (large  island)  struc¬ 
ture,  there  is  a  definite  preference  for  tunneling 
through  the  silicon  islands.  In  fact,  in  this  case  the 
island  transmission  fraction  increases  with  electron 
energy,  or  decreasing  deBroglie  wavelength. 

We  next  examine  the  effect  of  interface  roughness 
on  tunneling  properties  under  applied  bias.  We  use 
the  A  =  2.2  nm  structure  as  in  the  previous  figures. 
Figure  3  shows  a  set  of  transmission  coefficient 
spectra  for  this  structure  under  various  applied 
biases.  Transmission  spectra  for  the  corresponding 
VCA  structure  are  also  shown  for  comparison. 
Note  that  while  there  is  a  considerable  difference 
between  the  supercell  (with  interface  roughness) 
and  VCA  (without)  results  at  low  biases,  the 
difference  diminishes  at  high  biases.  This  indicates 
that  the  role  of  interface  roughness  becomes  less 
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FIGURE  3  Transmission  coefficient  spectra  for  a  MOS  tunnel 
structure  with  rough  Si/Si02  interfaces  at  several  applied  biases. 
Results  for  the  corresponding  VCA  structure  is  also  shown. 


Tunneling  thru  Thin  Oxide  Layer 


FIGURE  4  Island  transmission  fraction  for  the  MOS  tunnel 
structure  depicted  in  Figure  3. 


important  as  high  biases.  Figure  4  illustrates  this 
further  by  showing  the  island  transmission  fraction 
for  the  same  A  =  2.2  nm  structure  as  a  function  of 
applied  bias.  Note  that  while  there  is  a  preference 
for  tunneling  through  silicon  islands  at  low  biases, 
the  preference  drops  rapidly  as  the  applied  bias 
exceeds  3  volts.  The  reason  for  this  turns  out  to  be 
rather  simple.  At  low  biases,  the  device  structure 
operates  under  the  direct  tunneling  regime.  An 
electron  going  through  the  rough  interfacial  layer 
sees  the  oxide  portions  as  energy  barriers  and  the 
silicon  islands  as  open  pathways.  At  sufficiently 
high  biases,  the  device  operation  transitions  to  the 
Fowler-Nordheim  tunneling  regime,  where  the 
conduction  band  edge  at  the  trailing  interface  of 
the  barrier  (in  our  case,  the  rough  interfacial  layer) 
is  biased  below  the  incoming  electron  energies. 
Thus  the  oxide  portions  of  the  rough  interfacial 
layer  no  longer  act  as  electron  barriers,  putting 
them  on  more  equal  footing  with  the  silicon  islands. 
In  other  words,  in  the  low-bias  direct  tunneling 
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regime,  an  electron  traverses  through  the  oxide 
portions  of  the  interfacial  layer  with  evanescent 
characteristics,  and  the  silicon  portions  with 
propagating  characteristics.  In  the  high-bias  Fow- 
ler-Nordheim  tunneling  regime,  an  electron  tra¬ 
verses  through  both  types  of  the  interfacial  layer  as 
a  propagating  state,  hence  there  would  be  less 
reasons  for  preferential  pathways. 

In  summary,  we  have  studied  tunneling  through 
MOS  structures  with  rough  Si/Si02  interfaces 
using  a  3D  supercell  simulation.  We  find  that 
interfacial  non-uniformities  make  oxides  more 
permeable  in  the  direct  tunneling  regime,  espe¬ 
cially  in  structures  with  larger  islands.  In  the 
Fowler-Nordheim  regime,  however,  non-unifor¬ 
mity  effects  on  tunneling  characteristics  are  less 
pronounced. 
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A  fully  coupled  electro-thermal  hydrodynamic  model  is  described  which  is  suitable  for 
modelling  active  devices.  The  model  is  applied  to  the  non-isothermal  simulation  of 
pseudomorphic  high  electron  mobility  transistors  (pHEMTs).  A  large-scale  surface 
temperature  model  is  described  which  allows  thermal  modelling  of  semiconductor 
devices  and  monolithic  circuits.  An  example  of  the  application  of  thermal  modelling  to 
monolithic  circuit  characterization  is  given. 
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INTRODUCTION 

Until  relatively  recently  very  little  effort  has  been 
devoted  to  thermal  considerations  associated  with 
most  semiconductor  devices,  yet  in  many  cases  the 
temperature  of  the  device  and  self-heating  have  a 
very  strong  impact  on  their  performance.  The  high 
power  densities  associated  with  modern  micro- 
wave  transistors  makes  this  class  of  active  device 
particularly  sensitive  to  temperature  effects  and 
dependent  on  good  thermal  management.  Com¬ 
pound  semiconductors  such  as  GaAs  generally 
have  a  far  lower  thermal  conductivity  than  Si, 
which  can  in  turn  lead  to  high  operating  tempera¬ 
tures  and  poor  thermal  stability. 


Thermal  analysis  of  semiconductor  devices  can 
be  considered  in  two  broad  categories-solutions  in 
the  proximity  of  the  active  device  (usually  re¬ 
stricted  purely  to  a  local  hydrodynamic  treatment 
for  a  cross-section  of  a  few  square  microns)  [1,  2], 
and  solutions  which  encompass  the  whole  die, 
which  may  have  many  active  devices  or  elements 
that  are  interacting  thermally  [3,  4,  5].  Most  sim¬ 
ulations  that  follow  the  latter  large-scale  thermo¬ 
dynamic  approach  deal  with  steady-state  heat 
sources  and  do  not  encompass  the  active  device 
aspects.  In  practice,  it  is  desirable  to  link  the 
behaviour  on  the  microscopic  device  level  to  the 
macroscopic  scale,  since  this  would  allow  a  self- 
consistent  thermal  solution  to  be  obtained. 
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Contemporary  semiconductor  devices  often 
have  complex  geometries,  incorporating  air  bri¬ 
dges  and  heat  shunts  in  a  highly  three-dimensional 
structure.  There  is  generally  very  limited  control 
over  the  local  device  temperature  and  it  is  often 
very  difficult  to  measure.  Experimental  character¬ 
ization  usually  draws  on  temperature-sensitive 
liquid  crystal  paints  for  large-scale  devices  and 
infra-red  scanning  for  more  accurate  results.  The 
thermal  time  constants  associated  with  most 
practical  semiconductor  devices  lie  in  the  range 
10  ps  to  10  ms.  These  relatively  long  time-constants 
are  in  contrast  to  the  time-scales  over  which  most 
carrier  dynamics  occur  (typically  in  the  range  O.lps 
to  1  ns).  A  consequence  of  the  large  difference  in 
time  constants  is  that  it  is  not  usually  practicable 
to  follow  to  the  steady-state  the  evolution  of  full 
time  domain  electro-thermal  simulations  of  semi¬ 
conductor  devices,  where  the  time-dependent  heat 
flow  equation  is  solved  self-consistently  with  the 
carrier  transport  equations. 


THERMAL  AND  ELECTRO-THERMAL 
MODELING 

This  paper  addresses  the  modeling  of  thermal 
effects  in  semiconductor  structures,  examining 
the  basis  electro-thermal  simulation  and  then 
considers  the  impact  on  transport  modelling  and 
finally  large-scale  thermal  modelling  of  complete 
structures  (e.g.  a  full  die).  Self-heating  has  a 
significant  impact  on  parameters  such  as  mobi¬ 
lity,  gneration-recombination  and  trap  occu¬ 
pancy.  A  rigorous  electro-thermal  solution  is 
described  here  with  a  multi-cell  coupled  thermal 
and  transport  model.  This  is  achieved  by  solving 
a  set  of  hydrodynamic  equations  derived  from 
the  Boltzmann  transport  approximation  (utilizing 
four  moments),  coupled  to  an  accurate  solution 
of  the  heat  flow  equation. 

^  +  V(nv)=  0  (1) 
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clPl^=V(klVT)  +  Hs  (4) 


Hs  =  J  •  E  +  qEgG 


(5) 


Here  q  represents  the  charge  on  an  electron  and 
V  *  Q  represents  the  energy  flow,  n  is  the  electron 
density,  v  the  electron  velocity,  E  electric  field,  m* 
effective  mass,  xp  and  tw  momentum  and  energy 
relaxation  times  respectively,  and  w  average 
electron  energy,  cL  specific  heat,  pL  density,  kl 
lattice  thermal  conductivity,  T  lattice  temperature, 
and  G  generation-recombination  rate.  The  trans¬ 
port  parameters  are  assumed  to  be  temperature- 
dependent  and  are  obtained  from  bulk  Monte 
Carlo  simulations.  The  average  electron  energy  is 
expressed  in  terms  of  kinetic  energy  and  electron 
temperature  Te ,  as, 


w  =  ^m*v2  +  ^kTe  (6) 

where  Te  is  the  average  electron  temperature  and  k 
is  Boltzmann’s  constant.  The  energy  assumptions 
behind  this  model  are  discussed  in  [6]. 

The  numerical  solution  of  the  heat  flow  equa¬ 
tion  (equation  4)  requires  careful  consideration  to 
obtain  an  accurate  solution,  with  a  third-order 
boundary  condition.  Ghione  et  aL ,  have  suggested 
that  to  obtain  accurate  results  for  MESFETs,  the 
simulation  domain  for  analysis  should  be  extended 
horizontally  for  up  to  three  times  the  source-drain 
contact  spacing  and  to  a  depth  of  up  to  ten  times 
the  active  layer  thickness  of  the  device  [2]. 
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Thermal  modelling  of  semiconductor  devices 
requires  that  the  transport  and  material  para¬ 
meters  are  well  characterized  as  a  function  of 
temperature.  In  the  case  of  GaAs  the  following  set 
of  temperature-dependent  material  parameters  are 
required: 

Low  field  mobility  fio(T)  =  ^o(300) 

(7) 

Permittivity  er(T)  —  er(300)[l  +  Be{T -  300)] 

(8) 


Schottky  barrier  height  VBi(T)=  VBi{To)-kT(T-To) 

(9) 


Thermal  conductivity  kl  =  108.0(7T—  273.15)  026 

(10) 

where  £iO(300)  is  the  low  field  mobility  at  300  K, 
£r(300)  is  the  permittivity  at  300  K,  kT  is  the  tem¬ 
perature  coefficient  of  the  barrier  height  (in  the 
range  -0.26  to  —  1.6xl0“3  VK-1  depending  on 
temperature),  To  is  the  reference  temperature,  Be  is 
the  permittivity  temperature  coefficient  (in  the 
range  0.9  to  1.2  x  lO^KT1). 

Local  iso-thermal  and  non-isothermal  models 
have  been  developed  for  a  wide  variety  of 
semiconductor  devices  including  BJTs,  HBTs, 
MESFETs  and  HEMTs  (for  example  [7,  8,  9]). 
The  quasi-two-dimensional  pHEMT  model  de¬ 
scribed  in  [8,  9]  and  illustrated  in  Figure  1  is  well 
suited  to  electro-thermal  modelling  because  of  its 
very  high  speed  of  solution.  Examples  of  isother¬ 
mal  and  non-isothermal  DC  characteristics  ob¬ 
tained  using  this  approach  are  shown  in  Figure  2 
for  a  0.2  micron  gate  length  pHEMT.  It  should  be 
noted  that  this  pHEMT  model  also  includes  a  self- 
consistent  solution  of  the  Schrodinger  equation 
with  the  Poisson  and  transport  equations.  A 
comparison  of  measured  and  simulated  non-iso¬ 
thermal  DC  characteristics  for  a  similar  pHEMT 


FIGURE  1  Quasi-two-dimensional  model  for  a  pseudo- 
morphic  high  electron  mobility  transistor,  illustrating  the 
Gaussian  box  technique  as  described  in  [8]  and  [9]. 


are  shown  in  Figure  3.  Differences  between  the  two 
DC  characteristics  are  largely  attributable  to 
uncertainties  in  the  exact  density  of  the  pulse-doped 
layer  in  the  device  (there  is  no  fitting  of  the  data 
between  measured  and  simulated  results). 

A  three-dimensional  thermal  analysis  technique 
has  been  developed  which  allows  full-scale  analysis 
of  large  device  and  die  structures,  Figure  4.  The 
steady-state  heat  flow  equation  V  •  k{T)  V  T  =  0  is 
solved  for  a  three-dimensional  temperature  dis¬ 
tribution  T(x,y,z)  using  the  method  of  Liou  [4] 
and  Gao  et  al  [5].  Since  in  practice  most  semi¬ 
conductor  devices  are  located  very  close  to  the 
surface  of  most  die,  it  is  generally  only  required  to 
obtain  the  temperature  at  or  close  to  the  surface. 
In  these  circumstances  it  is  sufficient  to  calculate 
only  the  surface  temperature  T(x,y,  0).  Liou  and 
Gao’s  method  uses  a  double  Fourier  expansion 
method  to  speed  up  the  solution.  An  initial 
estimate  of  the  temperature  Tf ,  is  first  obtained 
by  assuming  that  the  temperature-dependent 
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lDS  (mA) 


0  1  2  3  4  5  6 

FIGURE  3  Measured  and  simulated  DC  non-isothermal 
characteristics  for  a  0.2  micron  gate  length  pHEMT. 


(b) 

VGs:  0.5  to  -2.5  V  steps  of  -0.5  V 

FIGURE  2  Simulated  DC  characteristics  for  (a)  isothermal 
and  (b)  non-isothermal  pHEMT  models,  implemented  using  the 
quasi-two-dimensional  model. 


thermal  conductivity  k(T)  is  equal  to  the  thermal 
conductivity  k,0  at  the  mounting  (heat  sink) 
temperature  r0, 
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KO  LxLy 
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where, 
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where  t  is  the  thickness  of  the  substrate,  m  and  n 
the  Fourier  coefficients.  The  active  region  of  the 
die  where  the  devices  are  located  is  subdivided  into 
regions  of  area  Ax  Ay  and  are  treated  at  heat 
sources  where  QtAxAy  is  the  heat  generated  at  the 
ith  unit  area  located  at  (xh  yt).  N  is  the  number  of 
unit  areas  on  the  surface.  The  upper  limit  of  the 
coefficients  m  and  n  must  be  as  large  as  possible 
and  in  practice  values  in  excess  of  70  ensure  a  local 
accuracy  of  better  than  1 K.  The  heatsink  is 
assumed  to  be  attached  to  the  bottom  of  the 
substrate  (at  T  ~  Tmount),  whilst  all  other  surfaces 
except  the  heat  sources  (the  active  region  of  the 
semiconductor  devices)  are  assumed  to  be  adia¬ 
batic.  The  temperature  T'  is  corrected  for  the 
temperature-dependent  thermal  conductivity  using 
Kirchhoff’s  transform,  yielding, 
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FIGURE  4  Three-dimensional  temperature  distribution  and  model  based  on  ‘line’  heat  sources  associated  with  the  local  heat 
generation  within  the  active  channel  of  the  device. 


T(x,y,  0)  = 


1 


(b-l)(T'-To) 


l  -(1/6-D 


n 


(12) 


which  assumes  that  the  thermal  conductivity  is 
proportional  to  (T/  T^)~b.  In  the  case  of  GaAs 
b =1.22. 

The  heat  sources  may  be  treated  locally  using 
the  coupled  electro-thermal  model,  and  the  solu¬ 
tion  is  iterated  between  the  local  transport  model 
and  the  full  die  model.  The  temperature  obtained 
from  the  3D  die  model  may  be  used  as  an  initial 
estimate  in  the  local  transport  model.  The  power 
density  obtained  from  the  solution  of  the  transport 
model  can  in  turn  then  be  used  to  set  £>/  in  the  3D 
die  model.  It  should  be  noted  that  it  is  necessary  to 
minimise  the  number  of  cells  since  a  separate 
transport  model  must  be  solved  for  each  cell.  This 
does  not  present  a  significant  computational  burden 
since  the  quasi-two-dimensional  models  can  be 
solved  in  less  than  0.01  cpu  seconds  on  a  typical 
workstation  and  in  practice  the  surface  temperature 
calculation  time  is  the  dominant  factor. 

An  example  of  the  application  of  thermal 
modelling  is  shown  in  Figure  5,  where  the  surface 
temperature  distribution  is  calculated  for  a 
pHEMTs  38  GHz  monolithic  power  amplifier 
design.  A  partial  view  of  the  monolithic  die  layout 
in  the  vicinity  of  the  pHEMTs  is  shown  in  Figure 
5(a)  and  the  region  chosen  for  thermal  analysis  is 
outlined  (an  area  of  800  x  400  microns).  The  die  is 


gate  drain 


(a) 


y  microns 


(b) 

FIGURE  5  Application  of  thermal  modelling  to  monolithic 
pHEMT  amplifier  design,  (a)  partial  view  of  layout  of  a 
balanced  38  GHz  monolithic  pHEMT  power  amplifier  (b) 
surface  temperature  distribution  for  the  area  defined  by  the 
dashed  line. 

400  microns  thick  and  the  mounting  temperature 
at  the  back  of  the  GaAs  substrate  is  40  °C.  The 
results  of  the  thermal  simulation  are  shown  in 
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Figure  5(b),  where  the  higher  temperatures  sur¬ 
rounding  the  pHEMT  finger  regions  are  evident. 
The  3D  thermal  simulator  utilized  over  20,000  unit 
cells  in  the  x,  y  plane  to  obtain  the  surface 
temperature  distribution;  This  technique  has  also 
been  applied  to  the  coupled  electro-thermal  mod¬ 
elling  of  heterojunction  bipolar  transistors  [10]. 
These  results  have  been  compared  with  infra-red 
measurements  and  were  found  to  show  good 
agreement  (better  than  6%  over  the  whole  die). 

CONCLUSIONS 

An  electro-thermal  model  has  been  presented  for 
semiconductor  devices  capable  of  describing  the 
interaction  between  the  lattice  temperature,  carrier 
transport  and  the  observed  electrical  behaviour.  A 
coupled  hydrodynamic-thermal  model  has  been 
used  to  represent  the  carrier  transport  in  the 
vicinity  of  the  active  region  of  the  device.  Large- 
scale  assessment  of  surface  temperature  has  been 
achieved  using  a  steady-state  electro-thermal 
model.  The  significance  of  thermal  effects  in 
small-scale  devices  has  been  illustrated  for  the 
case  of  the  short  gate-length  pHEMT  both  as  a 
discrete  device  and  integrated  into  a  practical 
monolithic  circuit. 
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Within  a  correlation-function  (CF)  formalism,  the  kinetic  coefficientsof  charge  carriers  in 
semiconductors  are  studied  under  different  conditions.  For  the  case  of  linear  response  in 
equilibrium,  thetransitions  from  the  non-degenerate  to  the  degenerate  regimes  as  wellas  from 
ballistic  to  diffusive  conditions  are  discussed  within  ananalytical  model.  Generalizing  the  method  to 
high-field  transport  innondegenerate  semiconductors,  the  CFs  are  determined  by  Monte  Carlo 
(MC)  calculations  for  bulk  silicon  from  which  the  appropriate  thermalconductivity  has  been 
obtained  and  included  into  the  hydrodynamic  code  HEIELDS.  For  an  n+nn+  submicron  structure 
the  temperatureand  velocity  profiles  of  the  carriers  have  been  calculated  with  HFIELDS. 


Keywords:  Kinetic  coefficients,  hydrodynamic  transport  equations,  correlation  functions,  Monte 
Carlo 


1.  INTRODUCTION 

Thermal  conductivity  of  charge  carriers  is  of 
fundamental  interest  in  describing  transport  phe¬ 
nomena  in  bulk  materials  as  well  as  electronic 
devices.  To  provide  a  microscopic  theory  for  this 
coefficient,  the  CF  approach  represents  a  very 
effective  method  [1].  As  a  consequence  of  the 
fluctuation-dissipation  theorem,  the  carrier  trans¬ 
port  coefficients  may  be  determined  by  the 
spectrum  of  the  fluctuations  in  the  system.  Results 
for  the  thermal  conductivity  available  in  literature 
are  mainly  based  on  relaxation-time  approxima¬ 
tions  [2,  3].  A  weighting  of  the  single  relaxation  time 


with  a  power  of  the  energy  yields  a  generalization 
ofthe  Wiedemann-Franz  law  (WFL)  given  by 

(1) 

where  c  is  the  so  called  power  law  exponent,  [i  is 
the  mobility  and  T  the  electron  temperature.  This 
WFL  together  with  the  energy  relaxation  time  is 
usually  introduced  within  hydrodynamic  ap¬ 
proaches  [4  ”8]. 

In  this  paper  we  present  the  appropriate  set  of 
CFs  for  the  cases  of  linear  response  around 
thermodynamic  equilibrium  in  the  ballistic  and 
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diffusive  regime  under  different  degeneracy  condi¬ 
tions  [9],  as  well  as  the  hot-carrier  regime  in  the 
classical-diffusive  condition  [10,  11]  with  its  appli¬ 
cations  to  the  simulation  of  an  n^nn  +  -structure. 


2.  LINEAR  RESPONSE  REGIME 


In  the  linear  regime,  generalized  fluxes  are  the 
response  of  the  material  to  generalized  externally 
applied  forces  mediated  by  the  kinetic  (or  Kelvin- 
Onsager)  coefficients  L ^  according  to 

(2) 

V 


with  /i,  v-  1,  2,  ji  and  j2  denoting  the  electrical 
current  and  the  energy-flux  densities,  respectively, 
while  X[  and  X2  are  the  generalized  driving  forces. 
The  kinetic  coefficients  in  general  depend 

on  angular  frequency  w  and  are  defined  by  means 
of  the  Fourier-Laplace  transforms  SM„(w)  of  the 
respective  CFs  C^v(t)  from  [12]: 


vs^M 

kBT 


(3) 


where  V  is  the  volume,  kB  is  the  Boltzmann 
constant,  T  the  absolute  temperature  and: 


POO 

=  /  dwC^{t)zxip(-iujt).  (4) 
Jo 


For  a  finite  one-dimensional  conductor  of  length  ( 
limited  by  ideal  (i.e.  completely  absorbing  and 
thermalizing)  contacts,  if  scattering  is  treated 
within  the  relaxation  time  approximation,  the 
thermal  conductivity  k  can  be  calculated  analyti¬ 
cally  showing  the  transitions  both  from  ballistic  to 
diffusive  (with  increasing  £)  and  from  degenerate 
to  non-degenerate  (with  increasing  T)  conditions. 
The  static  (i.e.  u  =  0)  limiting  cases  are  given  by  [9] 


.ndg,bl  _  nk2BT  _„dg  =  nt?k3J2T]/2 
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(7b) 

(7c) 

(7d) 


Here  r^dg  is  the  carrier  transit  time  in  the 
ballistic  ( bl )  case  under  non-degenerate  and 
degenerate  conditions,  respectively,  while  in  the 
diffusive  (df)  cases  rc  denotes  the  relaxation  time. 
Figure  1  shows  the  behavior  of  ft(0)  at  fixed  carrier 
density  as  function  of  £  and  T.  The  maximum 
value  is  reached  in  the  ballistic  degenerate  case. 


3.  FAR  FROM  EQUILIBRIUM 
(HOT-CARRIER)  CASE 


Here  C^Jj)  is  the  symmetrized  CF  at  thermal 
equilibrium 

C^{t)  =  I  <«JM(0)«J„(r)  +  fiJ„(0)«JM(0}  (5) 

where  the  6 Ju(t)  =  Jv- (J„)  denote  the  operators  of 
the  respective  observed  fluctuating  quantities,  and 
(. ..)  indicates  averaging  with  respect  to  the 
stationary  statistical  operator.  The  thermal  con¬ 
ductivity  k  is  given  as  a  function  of  the  LIJV 


The  CFs  of  microscopic  fluxes  are  calculated  with 
respect  to  stationary  values  at  the  given  bias  point 
and  the  thermodynamic  temperature  T  is  replaced 
by  the  noise-temperature  spectrum  T„{E,  w  =  0) 
[13]  associated  with  velocity  fluctuations  at  the 
given  field  E,  where  we  restrict  to  the  stationary 
case  (u>  =  0  ).  The  thermal  conductivity  parallel  to 
the  applied  electric  field,  k(E,  u>= 0)  ,  generalized 
to  hot-carrier  conditions  is  thus  given  by  Eq.  (6) 
with  the  following  replacement: 


k(w)  = 


L]\(lo)L22(u)  ~  L\2(oj)L2\(ljj) 

TLuH 


(6) 


T„(E,u)  = 


FRe[5n(£,^)] 
kBRe[<Td(E,  <u)]  ’ 


(8) 
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FIGURE  1  Thermal  conductivity  at  zero  frequency  (normal¬ 
ized  to  its  value  in  the  ballistic  degenerate  case)  at  a  fixed 
density  n  as  a  function  of  device  length  and  temperature. 
Vp(Ef°)  denote  the  Fermi  velocity  (Fermi  energy)  at  7=  OK 
for  the  given  density  n  and  rc  is  the  relaxation  time. 


where  vd  is  the  differential  conductivity.  Under 
thermal-equilibrium  conditions  (i.e. 

Tn{<jJ)  =  T,  and  standard  linear-response  formalism 
is  recovered  [14].  The  CFs  and  conductivities 
entering  the  definition  of  k  are  calculated  using 
MC  simulations  [15].  We  have  considered  the  case 
of  extrinsic  «-Si,  doping  concentration  1017cm-3, 
at  r=300  K.  Figure  2  from  a)  to  d)  reports  the 
four  longitudinal  CFs  C^it)  (fluctuations  of  flux 
variable  along  the  electric  field  direction)  at 
increasing  values  ofthe  electric  field  normalized 
to  their  respective  initial  values.  At  increasing 
fields  the  correlations  decay  faster.  C2 2(0  shows 
the  fastest  decay,  which  indicates  that  its  decay- 
rate  is  the  sum  of  momentum  and  energy  rates. 
Thus  we  see  that  even  at  thermal  equilibrium  an 
approach  based  on  a  single  time-scale  is  very  poor. 
We  observe  that  the  variances  Cn(0),  C22(0)  and 
Ci2(0)  =  C2i(0)  increase  systematically  with  in¬ 
creasing  electric  field, which  is  due  to  hot-carrier 
effects.  Figure  3  reports  thelow  frequency  spectral 
densities  S^V{E,  u;  =  0)  of  the  respective  CFs  as  a 
function  of  the  electric  field.  Their  different 


. . . . . 

o  10  1 


Time  (ps)  Time  (ps) 

FIGURE  2  Longitudinal  CFs  normalized  to  their  initial  value 
for  extrinsic  n-Si  with  doping  concentration  «=1017cm-3  at 
300  K  and  increasing  electric-field  strengths  of  1, 10,  and  100  kV/ 
cm. 


FIGURE  3  Spectral  densities  of  the  CFs  of  Figure  2;  at  u= 0 
as  a  function  of  electric-field  strength. 

behaviors  are  the  result  of  the  competitive  effect 
between  shortening  time-scales  and  increasing 
variances  of  the  corresponding  CFs  at  increasing 
fields.  The  carrier  thermal-conductivity  at  low 
frequency  calculated  from  Eq.  (6)  is  shown  in 
Figure  4.  It  is  compared  to  the  values  obtained 
from  a  parameterized  WFL  (1),  for  different  values 
of  the  power  law  exponent.  As  can  be  noticed,  the 


62 


A.  GREINER  et  al 


FIGURE  4  Thermal  conductivity  per  unit  concentration  in  n- 
Si  at  300  K  as  a  function  of  electric-field  strength.  The  different 
curves  are  obtained  by  setting  c  =  0.0,  c~— 1.0  and  c~—  2.1  in 
the  WFL  and  by  using  the  CF  approach  together  with  MC. 

strong  decrease  at  high  fields  may  not  be 
reproduced  by  a  parameterized  WFL. 


4.  DEVICE  SIMULATIONS 

To  study  the  role  of  thermal  conductivity  in  device 
csimulation,  the  transport  in  a  Si  submicron  n+  —n 
n+  structure  has  been  analyzed  by  means  of  the 
hydrodynamic  device  simulator  HFIELDS  [16]. 
The  simulated  structure  is  the  same  n  + —n—n  + 
diode  analyzed  in  [17]:  the  high  doped  n  +  -regions 
have  length  0.1pm  and  doping  concentration 
5xl017cm-3,  the  lower  doped  ^-region  has  length 
0.4  pm  and  doping  concentration  2xl015cm“3. 

The  electron  mobility  p(e)  and  energy-relaxa¬ 
tion  time  r£{e)  as  a  function  of  the  mean  electron 
energy  e,  required  in  the  hydrodynamic  simulator, 
are  calculated  from  the  velocity  and  energy  versus 
field  curves  obtained  from  MC  simulations. 

For  the  thermal  conductivity  both  the  models 
obtained  from  the  results  of  MC  simulations  and 
from  the  parameterized  WFL  are  adopted.  In 
Figure  5  the  velocity  profile  along  the  device 
obtained  with  different  models  of  the  thermal 
conductivity  are  shown.  When  the  WFL  (1)  is 
applied,  the  spike  in  the  electron  velocity  curve  can 


be  remarkably  reduced  by  setting  the  value  of  the 
power  law  exponent  to  c  =  —  2.1;  at  the  same  time 
the  velocity  profiles  is  smoothed  with  respect  to  the 
MC  solution  presented  in  [17]  for  the  same 
structure.  This  can  be  ascribed  to  the  slow  decrease 
of  the  WFL  thermal  conductivity  with  increa¬ 
sing  electric  field.  The  CF  approach  yields  to  a 
stronger  dependence  of  the  thermal  conductivity 
on  the  electric  field,  which  allows  to  obtain  a 
better  comparison  with  MC  data.  The  carrier 
temperature  profile  is  shown  in  Figure  6:  alsoin 
this  case  the  WFL  approach  tends  to  smooth  the 


FIGURE  5  Velocity  profile  of  the  n  +  —n—n  +  device  calculated 
with  four  different  thermal  conductivities  as  shown  in  Figure  4. 


FIGURE  6  Temperature  profile  of  the  n+—n—n+ device  for 
the  cases  as  shown  in  Figure  5. 
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profile  with  respectto  the  case  when  the  results  of 
the  CF  method  are  applied. 


5.  CONCLUSION 


We  have  shown  that  the  CF  formalism  provides  an 
effective  method  forcalculating  electronic  trans¬ 
port  parameters  in  different  transportregimes. 
These  parameters,  calculated  in  the  case  of  non- 
equilibriumand  incorporated  into  the  hydrody¬ 
namic  simulatorhfields,  allowfor  a  description  of 
the  electronic  behavior  of  submicron  structuresin 
agreement  with  the  results  of  more  sofisticated 
microscopic  approaches. 
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Molecular  wires  have  several  promising  features,  that  would  appear  to  make  them  ideal 
for  advanced  interconnects  in  nanoscale  electronic  devices.  We  discuss  several  aspects  of 
the  linear  and  nonlinear  conductance  of  molecular  wire  interconnects.  Topics  include 
energy  dependence  of  molecular  conductance,  resonant  tunneling  behavior,  control  of 
conductance  by  molecular  structure  and  geometry,  length  dependence  including  the 
tunneling  regime  energetics.  Design  rules  using  molecular  interconnects  will  differ 
substantially  from  those  with  more  standard,  lithographically  structured  silicon 
interconnects.  In  particular,  the  dissipation  mechanisms  will  differ,  both  tunneling  and 
ballistic  regimes  should  be  available,  coulomb  blockade  and  staircase  behavior  will  be 
observed  (but  under  differing  conditions)  and  fabrication  of  gate  electrodes  is  a  challenge. 


Keywords:  Molecular  wire,  interconnect,  resonant  tunneling,  molecular  electronics 


1.  INTRODUCTION 

Early  work  in  molecular  electronics  was  largely 
speculative,  because  the  appropriate  fabrication 
and  characterization  methods  were  not  available 
on  the  nanometer  distance  scale  characteristic  of 
molecular  structures  [1-7].  Feynman’s  remarks 
concerning  fabrication  from  the  bottom  up  (by 
molecular  or  atomic  assembly)  rather  than  from  the 
top  down  (by  masking/lithograph  techniques)  is  the 
intellectual  antecedent  for  much  of  this  work  [8]. 


The  advent  of  molecular  surface  functionaliza¬ 
tion  and  self-assembly  methods  [9],  coupled  with 
synthetic  organic  and  inorganic  chemical  methods 
for  producing  molecular  wire  structures  reprodu- 
cibly  and  in  good  yield  [10,  11],  has  improved 
candidate  structures  for  molecular  wires.  Deve¬ 
lopment  of  both  scanning  probes  (especially  STM) 
and  nanofabricated  measurement  structures  such 
as  mechanically  controllable  break  junctions  [12], 
nanopore  enclosures  [13]  and  assembled  nanodot 
arrays  [14]  has  permitted  structural  investigation 


*  Corresponding  author.  Tel.:847/491-5652,  fax:  847/491-7713,  e-mail:  ratner@chem.nwu.edu. 


65 


66 


M.  KEMP  et  al 


of  individual  molecules  or  assemblies  at  electrode 
interfaces,  and  deduction  of  conductances  for 
single  molecules  [1,  12,  15,  16]. 

Within  the  past  18  months,  individual  molecules 
have  been  shown  to  exhibit  resonant  tunneling 
behavior  [17],  coulomb  blockade  and  staircase 
behavior  at  room  temperature  [12],  gate  control  of 
electronic  currents  [17,  18],  long  distance  (micron) 
current  carrying  capability  [17,  18],  and  rectifica¬ 
tion  properties  [19].  Coupled  with  related  work  on 
inelastic  electron  tunneling  spectroscopy  [20],  on 
Langmuir-Blodgett  film  conductivity  [21],  and  on 
the  much  larger  area  of  conductive  organic 
materials  [22],  this  recent  work  constitutes  an 
impressive  knowledge  base  suggesting  that  the  use 
of  molecular  wires  as  interconnects  is  practical  and 
promising. 

Important  advantages  of  molecular  wires  in¬ 
clude: 

1 .  Size  (characteristic  radii  of  molecular  wires  are 
subnanometer,  while  length  can  vary  from  sub¬ 
nanometer  to  micron). 

2.  Preparation:  synthetic  chemistry  allows  a  myri¬ 
ad  of  candidate  structures,  with  differing 
rigidity,  optical  properties,  band  gaps,  electro¬ 
nic  couplings,  Vibronic  couplings,  chemical 
reactivity,  optical  behavior,  etc.  [11,  22]. 

3.  Low  Cost:  most  polymer  prices  are  quoted  per 
pound  or  per  ton. 

4.  Tailorability  and  functionalization  possibility: 
molecular  wire  structures  can  be  varied  using 
synthetic  chemistry;  attachment  to  electrodes 
can  vary  in  bond  strength  and  electronic 
coupling  [11,  22]. 

While  much  of  the  interest  in  molecular  electro¬ 
nics  has  been  in  device  structures  [1  -7]  (molecular 
rectifiers  [19,  23,  24],  molecular  logic  gates  and 
switches  [25],  molecular  two-state  devices  or  n - 
state  devices,  molecular  recognition  sensors  [26], 
molecular  optoelectronic  gates),  it  is  probably  the 
interconnect  structure  of  molecules  that  is  most 
promising  for  hybrid  molecular/solid  state  electro¬ 
nics. 


2.  MOLECULAR  WIRE  CONDUCTANCE 

Electron  transfer  through  molecular  wires  is 
closely  related  to  intramolecular  electron  transfer, 
in  which  a  molecular  bridge  is  used  to  facilitate 
transfer  of  a  single  electron  between  terminal 
donor  and  acceptor  molecular  species  [27,  28]. 
The  difference  between  these  two  situations  is  that 
the  continuum,  that  causes  linear  transport  (rate 
constant  in  the  intramolecular  electron  transfer 
situation  or  conductance  in  the  molecular  wire) 
will  differ:  for  intramolecular  electron  transfer,  the 
continuum  is  vibronic,  while  for  molecular  wires, 
the  continuum  is  the  electronic  energy  level 
continuum  of  the  electrodes  [27]. 

A  characteristic  molecular  wire  structure  is 
shown  in  Figure  1A,  in  which  a  thiol  terminated 
aromatic  molecular  bridge  is  linked  to  gold 


FIGURE  1A  Schematic  of  a  molecular  wire  group  bridge 
between  two  continuum  metal  electrodes. 


FIGURE  IB  Energy  level  schematic  diagram  of  a  molecular 
wire  structure  like  that  in  Figure  la.  The  hemispherical 
structures  represent  the  energy  bands  of  the  metal;  EF  is  the 
Fermi  energy.  The  discrete  energy  levels  are  those  of  the  site 
orbitals  in  the  molecule,  while  V\  and  VN  are  the  matrix 
elements  coupling  the  terminal  sites  in  the  molecular  wire  to  the 
metallic  interface. 
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electrodes;  in  the  sketch,  the  electrode  link  is 
indicated  as  a  single  interaction,  but  its  actual 
nature  will  vary  depending  on  the  chemical 
binding  or  (in  less  favorable  cases)  physisorption 
interaction  between  bridge  and  electrode.  The 
model  that  we  will  use  for  computing  the  conduct¬ 
ance  is  sketched  in  Figure  IB.  This  is  an  energy- 
level  diagram  showing  the  continuum  electrode 
levels,  the  fermi  energy,  the  discrete  energy  levels 
corresponding  to  the  atomic  sites  in  the  molecular 
wire,  and  the  characteristic  tunneling  strengths 
(electronic  mixing  matrix  elements)  V\  and  VN  at 
the  first  and  Nth  sites  of  the  wire. 

Several  formalisms  for  calculating  the  conduc¬ 
tance  are  available.  Lang  has  developed  [29]  the 
density  functional  approach  using  a  jellium  model, 
and  has  demonstrated  its  ability  to  deal  with 
atomic  wire  structures.  Joachim  and  collaborators 
[30]  have  used  an  approach  in  which  the  electrode 
is  represented  as  a  tight  binding,  finite  metal  block. 
Datta  and  collaborators  [31],  and  our  group,  have 
utilized  a  formalism  in  which  the  interaction 
between  the  discrete  states  of  the  molecular  wire 
and  the  continuum  states  of  the  electrode  is  repre¬ 
sented  by  a  Newns/Anderson  formalism  [32]. 
Details  are  given  elsewhere  [33],  but  the  important 
point  is  that,  due  to  the  bridge,  the  first  and  last 
sites  in  the  molecule  acquire  a  self-energy,  denoted 
£.  That  is, 

Hn^Hn+ZiiE) 

Hnn  — ►  Hm  +  ) 

Molecular  site  energies  are  unchanged,  except  for 
the  first  and  last  sites,  each  of  which  acquires  a  real 
part  (energy  shift)  and  an  imaginary  part  (life¬ 
time).  Equation  2  indicates  the  two  parts  of  the 
self-energy. 

Y,(£)  =  A*(U  -  M£)  K=l,N.  (2) 

k 

Using  the  results  of  this  discrete/continuum 
interaction  representation,  and  the  Bardeen  tun¬ 
neling  formula  [34],  for  conductance,  one  can 
derive  formulas  for  the  conductance  g  or  current  I 


both  in  the  linear  regime  (current  assumed  propor¬ 
tional  to  voltage)  and  in  the  more  general 
situation.  These  results  are  given  by,  respectively, 

g  =  -^  A\(Ef)An(Ef)\Gin(Ef)\2  (3) 

I  =  %  [EF  dEAx{E)AN{E  +  eW)\G1N{E,  W)\2 

™  J EF—eW 

(4) 

Here  Ef  is  the  electrode  Fermi  level  (injection 
level),  W  is  the  applied  voltage,  47  is  the 
bandwidth,  AN  and  A!  are  the  imaginary  parts 
of  the  self-energy  from  Equation  2,  given  by 


ME)  =  V\ /7j  ^  f 2/472 


\E/27\  <  1 
\E/2l\  >  1 


(5) 


Finally,  G\N  is  the  (1  ,N)  matrix  element  of  the 
electronic  Green’s  function,  described  by 

G~\E)=E+ir1  +  Y,{E)-H-)  (6) 

Here  H  is  the  molecular  electronic  Hamiltonian, 
and  77  is  a  positive  infinitesimal. 

The  physical  interpretation  of  Equation  3  is 
straightforward:  the  wire  conductance  is  given  by 
the  atomic  unit  of  conductance  times  the  electrode 
interaction  at  the  first  and  Nth  sites  of  the  bridge, 
times  the  Green’s  function  term  that  describes  the 
electronic  mixing  between  the  orbitals  at  the  two 
terminal  sites  of  the  molecular  wire.  Equation  4 
generalizes  this  to  the  dependence  upon  applied 
voltage,  and  is  applicable  to  the  situation  in  which 
the  source/drain  current/voltage  curve  is  not 
linear.  This  equation,  thus,  permits  understanding 
of  staircase  behavior  [33].  That  requires,  though, 
proper  solution  to  the  Poisson  equation  that 
describes  the  potential  dependence  of  the  molecu¬ 
lar  site  energetics.  This  problem  has  not  really 
been  addressed  yet  in  the  published  literature. 

Although  descriptions  of  the  staircase  behavior 
using  Equation  4  have  been  given  [33]  we  limit 
ourselves  here  to  the  linear  conductance  behavior, 
as  the  simplest  characterization  of  the  interconnect 
ability  of  these  molecular  wires. 
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3.  STRUCTURAL  AND  ENERGETIC 

DEPENDENCE  OF  WIRE  CONDUCTANCE 

Molecular  wires  will  exhibit  substantial  sensitivity 
to  the  details  of  the  molecular  electronic  structure. 
Understanding  the  dependence  of  conductance  on 
laboratory  parameters  such  as  construction  of  the 
wire  itself,  temperature,  chemical  linkage  between 
wire  and  electrode  and  injection  energy  requires  a 
combination  of  chemical  concepts  for  the  wire  and 
continuum  concepts  for  the  interface. 

3.1.  Conductance  Variation  with  Bridge  Chemical 
Structure  and  Injection  Energy 

Three  continuous  variables  characterize  the  con¬ 
ductance  of  the  molecular  wire  interconnect:  the 
injection  or  Fermi  energy,  the  temperature,  and 
the  potential  difference  between  the  two  electrodes; 
for  a  linear  conductance  as  given  by  Equation  3, 
the  latter  variable  can  be  ignored.  Temperature 


dependence  is  a  complicated  problem;  one  expects 
statistical  factors  in  the  electrodes  and  various 
shape  modification  of  the  barrier  by  the  applied 
field  (essentially  the  Fowler-Nordheim  treatment) 
[35]  and  temperature  dependent  behaviors  in  the 
wire  itself,  including  possible  electron  trapping 
behavior,  dephasing,  and  structural  changes.  For 
simplicity,  we  neglect  temperature  effects  (limit  of 
low  temperatures). 

The  only  important  continuous  variable  is  then 
the  injection  energy.  Figure  2  shows  the  predicted 
energy  dependence  for  a  fully  conjugated  (assumed 
planar)  diphenyl  bisthiol  structure  with  gold 
electrodes.  The  notations  a  and  i r  refer  to  the 
angular  momentum  symmetries.  The  effective 
injection  has  both  a  and  7r  in  planar  geometries 
these  are  good  symmetry  quantum  numbers,  so 
that  the  channels  can  be  separated.  Several 
interesting  aspects  of  Figure  2  can  be  noted:  there 
is  resonant  tunneling  behavior  characterized  by 
negative  differential  conductance  -  that  is,  as  the 


I _ , _ a '  _ i _ . — ! J — L. '  1  1  '/  ■  i  i  ■  ■  ! i 

-15  -13  -11  -9  -7  -5  -3  -1 

Fermi  energy  (eV) 


FIGURE  2  Conductance  of  the  molecular  structure  in  Figure  1A,  calculated  using  the  linear  approximation  formula  of  Equation  3. 
Note  the  resonances  ( —  lines  represent  molecular  orbital  eigenvalues).  Resonant  tunneling  occurs  whenever  the  7r  level  is  degenerate 
with  the  Fermi  energy.  Conductance  maxima  are  very  close  to  the  atomic  unit  of  conductance,  corresponding  to  efficient  resonant 
tunneling. 
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injection  energy  passes  through  one  of  the 
resonances  corresponding  to  a  molecular  orbital 
of  the  bridging  structure,  the  conductance  exhibits 
a  local  maximum,  subsequently  decreasing  with 
increased  injection  energy.  The  self-energy  terms 
of  Equation  2  become  important  here:  divergence 
of  the  conductance  is  avoided  because  of  the 
continuum  structures  of  the  electrodes,  that 
provide  imaginary  (damping)  behavior  at  the 
terminal  sites  of  the  molecular  bridge.  The  finite 
conductance  will  be  reduced  compared  to  the 
atomic  conductance  g0  =  (12.9K.fi)”1,  by  effects  of 
molecular  mixing,  of  the  interfacial  covalent  link, 
and  of  vibrational  dissipation  that  are  not  included 
in  this  discussion  [40-42].  In  general,  the  7r-type 
contributions  are  larger  than  a:  this  is  in  accor¬ 
dance  with  a  huge  amount  [36]  of  organic  chemical 
wisdom,  that  delocalization  through  the  re  system 
is  much  more  effective  than  a. 

For  actual  conductance  computation,  it  is 
important  to  know  the  injection  energy.  This 


problem  is  not  yet  solved,  although  it  seems 
reasonable  to  assume  [31,  33]  that  the  injection 
energy  will  be  fixed  at  mid-gap.  This  assumption 
has  been  made  in  several  computations,  and 
agreement  with  experiment  is  generally  very  good. 
Actual  proof  requires  self-consistent  solution  to 
the  Poisson  equation  that  describes  actual  electron 
density  /potential  structures  in  the  inhomogeneous 
interconnect  situation. 

Since  7r-delocalization  is  generally  stronger  than 
<7,  one  might  expect  that  twisting  of  the  molecular 
structure  to  impede  the  re  conjugation  would 
decrease  conductance.  This  is  shown  in  Figure  3 
for  triple  bonded  and  double  bonded  systems 
respectively,  and  the  differences  are  striking.  The 
angular  dependence  is  much  smaller  in  the  triple 
bond  case,  essentially  because  the  triple  bond  is 
cylindrically  symmetric.  Conversely,  the  double 
bond  exhibits  mirror  symmetry  in  the  plane,  so 
that  twisting  by  ir/2  around  it  results  in  localiza¬ 
tion  and  strongly  reduced  conductance.  The  actual 


FIGURE  3  Computed  conductance,  as  a  function  of  twist  angle,  for  molecules  of  the  same  structure  as  in  Figure  la,  but  with 
double  and  triple  bonds  inserted  between  the  rings.  When  the  7r  systems  become  orthogonal  (90  degree  twist  angle),  the  conductance 
is  substantially  reduced.  The  functional  dependences  on  the  double  or  triple  bonds  differ,  because  of  the  different  symmetries 
involved. 
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ular  wires  in  the  absence  of  dissipation,  dephasing, 
trapping,  and  defect  formation  effects.  Accord¬ 
ingly,  Figure  4A  shows  the  predicted  conductance 
with  thiol  (sulfur)  first  and  last  chain  sites  and  the 
indicated  bridging  units.  Several  characteristic 
behaviors  expected  on  the  basis  of  organic 
chemistry  are  apparent  here:  the  n  systems  are 
better  conductors  than  the  a  systems,  bond-length 
alternation  effects  make  the  alkynes  less  good  than 
the  alkenes,  changing  local  bond  geometries  and 
so-called  cis  and  trans  linkages  change  the 
conductance  of  the  polyalkyne  structures.  j 

Decay  is  exponential  with  length  in  all  cases.  J 

The  reason  is  that  the  electron  is  never  actually 
localized  on  the  molecular  bridge,  but  rather  j 

undergoes  bridge-assisted  coherent  tunneling  be¬ 
tween  electrodes.  This  arises  because  in  our  formal 
approach,  dissipation  mechanisms,  that  would 
allow  the  electron  actually  to  trap  on  the  bridge,  I 

are  not  included.  ! 

loglO  (conductance)  vs  molecular  length 


(A)  molecular  length  (angstrom) 

FIGURE  4A  Logarithm  of  the  conductance  versus  molecular  length,  for  typical  double  bonded  and  triple  bonded  extended 
systems  (such  as  those  in  Figure  1A,  but  with  double  bond  or  triple  bond  chains  introduced  between  the  rings).  Notice  the 
characteristic  exponential  decay  with  distance,  and  the  different  decays  of  the  triple  and  double  bonded  systems. 


conductance  of  the  double  bonded  structures  are 
larger  than  that  of  the  triple  bonded  bridges, 
because  of  bond  alternation  effects  that  result  in 
stronger  electron  localization  for  the  triple  bonded 
(alkyne)  than  the  double  bonded  (alkene)  struc¬ 
tures.  Alkyne  linkages  are  of  interest  in  connection 
with  molecular  photonic  wires  [37]  and  nonlinear 
optical  devices  [38]. 


3.2.  Length  Dependence  -  Tunneling 
and  Ohmic  Behaviors 

Molecular  wires  generally  are  not  ohmic,  except 
under  special  conditions.  For  an  ohmic  linear  wire, 
one  expects  the  conductance  to  scale  as  the  inverse 
wire  length.  Exponentially-decaying  tunneling  has 
been  well  documented  in  intramolecular  electron 
transfer  reactions  [28,  29],  and  is  expected  on  a 
series  of  theoretical  grounds  to  characterize  molec- 
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Inclusion  of  dissipation  has  been  discussed  for 
intramolecular  electron  transfer  [40-42]  (the  rate 
constant  for  intramolecular  electron  transfer  is 
expected  to  scale  generally  as  the  conductance  for 
molecular  wires,  although  inclusion  of  dissipation 
in  real  molecular  wire  calculations  has  not  yet  been 
presented).  Figure  4B  presents  the  length  depen¬ 
dence  of  the  intramolecular  transfer  rate  if  a  tight 
binding  model  is  assumed,  and  the  matrix  element 
Vb  characterizes  the  mixing  between  the  sites  of  the 
molecular  bridge  of  Figure  1.  The  gap,  u>,  is  the 
energy  required  to  place  an  electron  in  an  affinity 
level  (that  is,  to  reduce  chemically)  of  the  mole¬ 
cular  bridge.  For  fixed  mixing  matrix  element 
V\  =  VN  =  Vb,  there  is  substantial  energy  varia¬ 
tion  of  the  rate  constant.  Indeed,  when  the  gap 
becomes  comparable  to  the  mixing  matrix  element, 


there  is  essentially  no  decay  of  the  rate  constant 
with  the  length  of  the  bridge  (since  the  continuum 
is  included,  resonance  tunneling  does  not  result  in 
a  divergence  of  the  rate  constant).  Physically, 
dissipation  mechanisms  will  always  occur,  so  that 
no  true  length  independence  can  occur.  In  weak 
dissipation,  however,  the  length  decay  may  be  very 
small,  as  has  recently  been  observed  [17,  18]  in  very 
low  temperature  measurements  on  carbon  nano¬ 
tube  structures. 

The  energy  gap  u  of  Figure  4B  is  related  to  the 
injection  energy  of  Figure  2;  vanishing  u  corres¬ 
ponds  to  injection  exactly  at  one  of  the  resonances 
in  Figure  2. 

Functional  dependence  of  the  rate  constant 
(conductance)  can  be  formulated  simply  if  Vbjuj 
is  much  less  than  one,  and  the  tight  binding  model 


V  =  100  cm'1,  aG°  =  0  cm'1,  X  =  1500  cm'1 
T  =  300  K,  r  =  0  cm'1 


(B)  N 

FIGURE  4B  The  calculated  intramolecular  electron  transfer  rate  (units  of  s’1).  The  parameters  are  those  of  a  simple  tight  binding 
model,  as  in  Equation  7;  V  is  the  matrix  element  between  neighboring  sites,  A  the  reorganization  energy,  and  w  the  gap  between  donor 
or  acceptor  levels  and  bridge  levels.  Note  that  the  characteristic  decay  parameter  (the  /3  of  Equation  9)  indeed  strongly  depends  on 
the  gap  size. 
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•(to  chemists,  the  Huckel  model)  is  assumed  for  the 
molecular  bridge.  Under  these  conditions,  the 
Green’s  function  will  scale  like  [42]  (/j=l) 

Gin  ~  1 1  -{Vb/w)N  1  (7) 

UJ 

for  an  TV-site  bridge,  and  then  the  conductance 
behavior  (or  rate  constant)  is  expected  to  follow 
Equation  8. 

g(N)=g(l)exp{-/3(N-\)R0}  (8) 

This  exponential  decay  is  then  characterized  by  the 
constant  /?,  which  in  the  simplest  (super  exchange) 
model  is  given  as 

P  =  -^\n(VB/uj)  (9) 

Here  R0  is  the  repeat  unit  length  on  the  molecular 
bridge.  Exponential  behavior  has  been  very  well 
studied  for  intramolecular  electron  transfer  rates 
[28,  43]. 

4.  REMARKS 

Molecules  are  of  substantial  interest  as  possible 
device  components  in  advanced  electronics  and 
optoelectronics;  their  most  obvious  and  promising 
short-term  application  is  in  the  area  of  intercon¬ 
nects.  Electron  transfer  can,  then,  be  used  directly 
to  pass  current  among  active  device  components 
(or,  more  speculatively,  to  provide  field  switching 
possibilities)  [46]. 

We  have  analyzed  several  of  the  characteristic 
behaviors  to  be  expected  for  single  molecule 
interconnects.  Even  in  relatively  unfavorable  cases, 
single  molecules  can  carry  a  current  of  order  0. 1  nA 
for  applied  fields  of  one  volt;  [14,  15],  this  is  large 
enough  for  applications,  and  can  be  increased  in 
more  favorable  cases  such  as  single-walled  nano¬ 
tubes  [18]  or  strongly  conjugated  oligomers. 

There  are,  essentially,  two  regimes  in  the 
conductance  of  characteristic  molecular  wires.  At 
low  temperatures,  resonant  tunneling  structures 


are  expected,  with  substantial  increases  in  the 
conductance  (approaching  the  atomic  unit  of 
conductance),  near  molecular  resonances,  suggest¬ 
ing  possible  resonant  tunneling  applications.  With 
increasing  temperature,  dephasing  processes  will 
become  important  and  one  will  eventually  find 
ohmic  behavior,  with  the  conductance  scaling 
inversely  with  length  [42].  Because  molecules  are 
substantially  smaller  than  the  usual  quantum  dot 
structures,  coulomb  blockade  and  staircase  effects 
can  be  seen  [12]  not  only  at  low  temperatures  but 
also  at  room  temperature,  suggesting  the  possibi¬ 
lity  of  single  electronics  under  ambient  conditions. 

The  work  on  molecular  wires  to  date  is  at  the 
proof  of  concepts  stage:  problems  such  as  the 
relative  resistance  of  particular  molecular  wire 
structures  [30,  31],  the  specific  attachment  of 
molecular  wires  to  electrode  surfaces,  understand¬ 
ing  the  temperature  and  length  dependence  of  the 
conductance  [33],  and  engineering  the  geometry  of 
particular  molecular  wire  interconnects  [11]  have 
been  discussed.  It  is  clear  that  use  of  molecular 
wires  as  circuit  elements  still  requires  overcoming 
several  major  obstacles,  of  which  the  most  serious 
is  probably  addressing  individual  molecular  wires, 
and  anchoring  them  in  a  specific  geometry  on 
assembled  chip  structures. 
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Ionic  Channels  in  Biological  Membranes:  Natural 
Nanotubes  Described  by  the  Drift-Diffusion  Equations 
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An  important  class  of  biological  molecules -proteins  called  ionic  channels -conduct 
ions  (like  Na+,  K  +  ,  Cl-)  through  a  narrow  tunnel  of  fixed  charge  (‘doping’).  Ionic 
channels  are  the  main  pathway  by  which  substances  move  into  cells  and  so  are  of  great 
biological  and  medical  importance:  a  substantial  fraction  of  all  drugs  used  by  physicians 
act  on  channels.  Channels  can  be  studied  in  the  tradition  of  computational  electronics. 
Drift  diffusion  equations  form  an  adequate  model  of  IV  relations  of  6  different  channel 
proteins  in  10  solutions  over  ±150  mV.  Ionic  channels  can  also  be  studied  with  the 
powerful  techniques  of  molecular  biology.  Atoms  can  be  modified  one  at  a  time  and  the 
location  of  every  atom  can  be  determined.  Ionic  channels  are  natural  nanotubes  that  can 
be  controlled  more  precisely  and  easily  than  physical  nanostructures  but  biologists  need 
help  if  realistic  simulations  are  to  be  done  atomic  detail. 


Keywords:  Biological  membranes,  ionic  channels,  drift  diffusion  equations,  Gummel  iteration 


Drift-diffusion  equations,  combined  with  Pois¬ 
son’s  equation  ( PDD ),  are  widely  used  in  physical 
sciences  to  describe  the  flux  of  charge  carriers 
through  systems  containing  fixed  charge  (doping) 
[28].  An  important  class  of  biological  molecules - 
proteins  called  ionic  channels -conduct  ions  (like 
Na  +  ,  K+,  Cl"),  and  thus  current,  through  a 
narrow  tunnel  of  fixed  charge  (‘doping’)  formed  by 
the  polar  residues  of  the  protein  [36]  although 
electron  flow  plays  no  direct  role  in  their  conduc¬ 
tion  of  current.  These  proteins  can  be  studied  with 
the  full  power  of  molecular  biology  [1];  for 
example,  they  can  be  modified  one  atom  at  a  time 


with  the  techniques  of  molecular  genetics.  Thus, 
these  natural  nanotubes  are  a  natural  ‘hole  in  the 
wall’  that  can  be  controlled  more  precisely  and 
easily  than  many  physical  nanostructures. 

Ionic  channels  open  and  close  (‘gate’)  to  give 
currents  that  are  a  random  telegraph  signal  [34]. 
The  properties  of  gating  are  complex  and  the 
structure(s)  and  mechanism(s)  that  produce  gating 
are  not  known  [23],  but  the  flow  of  ions  through 
open  channels  is  much  simpler,  and  obeys  the 
PDD  equations,  as  we  shall  see  [10-13,  19,  37]. 

Channels  are  the  main  pathway  by  which 
substances  move  in  and  out  of  cells  and  so  are  of 
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great  biological  and  medical  importance:  they  are 
responsible  for  signaling  in  the  nervous  system;  for 
coordination  of  muscle  contraction -including  the 
coordination  of  cardiac  muscle  that  allows  the 
heart  to  function  as  a  pump -and  they  are 
involved  in  transport  in  every  cell  and  organ,  for 
example,  in  the  kidney,  intestine  and  endocrine 
glands  [1,  36].  A  substantial  fraction  of  all  drugs 
used  by  physicians  act  directly  or  indirectly  on 
channels  [35]. 

Channels  are  studied  one  molecule  at  a  time  in 
hundreds,  if  not  thousands  of  laboratories  every 
day  [14-16,  31],  using  Neher  &  Sakmann’s  patch 
clamp  method  [6,  32,  34]  (for  which  they  received 
the  Nobel  Prize).  The  concentrations  of  ions 
outside  channels  (that  carry  current  through  the 
channel)  can  be  directly  controlled  and  the  shape  of 
current  voltage  (IV)  relations  can  be  manipulated. 
In  this  way,  a  wide  range  of  IV  behavior  can  be 
measured  from  the  single  doping  profile  of  one  type 
of  channel  and  so  much  can  be  inferred  about  the 
doping  profile  from  IV  measurements  (if  they  are 
taken  in  many  different  (pairs  of)  concentrations 
of  current  carriers).  For  all  these  reasons,  channels 
are  a  popular  object  for  experimentation:  thou¬ 
sands  (!)  of  abstracts  describing  their  properties  are 
presented  each  year  at  the  annual  meeting  of  the 
Biophysical  Society  and  hundreds  of  papers  are 
published  about  them,  chiefly  in  the  Biophysical 
Journal  and  the  Journal  of  Physiology  (London). 

Channels  are  also  an  appealing  and  important 
object  for  theoretical  analysis  and  numerical 
simulation.  Open  channels  are  probably  the 
simplest  protein  structures  of  general  biological 
importance.  Unlike  many  other  subjects  of  bio¬ 
physical  investigation,  ionic  channels  are  a  general 
biological  system  with  importance  for  every  organ, 
tissue,  and  cell  in  an  animal  and  plant.  Indeed, 
they  are  probably  just  as  important  for  subcelluiar 
organelles.  Ionic  channels  are  well  defined  biolo¬ 
gical  systems  that  can  be  investigated  both  with 
the  techniques  of  molecular  biology  and  of 
biophysical  chemistry,  helped  substantially  by  the 
techniques  and  insights  of  semiconductor  physics, 
I  believe. 


Ionic  movement  plays  an  important  role  in  the 
function  of  all  proteins -e.g.,  enzymes- and  so  a 
model  that  describes  ionic  movement  in  channels  is 
likely  to  give  important  insight  into  protein 
function  in  general.  Indeed,  the  closely  related 
Poisson-Boltzmann  theory  [18,24]  has  been  of 
considerable  help  already,  even  though  it  is  a 
strictly  equilibrium  theory  that  does  not  permit 
flux  at  any  time  or  location. 

Theories  of  physical  chemistry  [4]  and  electro¬ 
chemistry  [30]  certainly  should  be  able  to  predict 
the  movement  of  ions  through  a  tunnel  of  fixed 
charge -a  hole  in  the  wall -on  the  biological  time 
scale  of  100  p sec- 10 sec,  but  they  need  to  be 
supplemented  by  the  theories  and  simulations  of 
carrier  transport  in  general,  e.g.,  in  semiconduc¬ 
tors  [28].  The  physical  chemical  tradition  has  not 
often  dealt  with  flux  [2,  3,  5,  8,  9,  21,  22,  25,  29]  in 
such  a  system  of  fixed  charge,  whereas  the  flux  of 
charge  carriers  has  been  the  main  subject  of 
semiconductor  physics  and  computational  electro¬ 
nics  for  many  years,  if  not  decades. 

Five  laboratories  have  shown  that  the  PDD  of 
semiconductor  physics  form  an  adequate  model  of 
IV  relations  of  6  different  channel  proteins  in  ~10 
pairs  of  solutions  (containing  different  concentra¬ 
tions  of  the  charge  carriers  Na+,  K  +  ,  Cl-)  in  the 
range  ±150  mV  [10-13,  37].  The  IV  relations  are 
qualitatively  different  in  different  types  of  chan¬ 
nels -some  are  linear,  some  sublinear  and  some 
superlinear- because  different  channel  proteins 
have  qualitatively  different  profiles  of  fixed  charge 
arising  from  their  different  sequences  of  amino 
acids. 

The  structures  of  two  of  these  proteins  (porin 
and  its  mutant  G-119D)  are  known  from  the 
standard  methods  of  molecular  biology:  the  loca¬ 
tion  of  every  atom  has  been  determined  by  X-ray 
diffraction  [17,  26,  27,  33,  38]  with  an  accuracy  of 
~  0. 1  A.  The  mutant  has  one  extra  negative  charge. 
Measurements  of  IV  relations  from  a  single 
molecule  of  porin  [37]  allow  the  PDD  model  to 
estimate  the  additional  charge  as  -0.97  e,  although 
this  estimate  will  undoubtedly  change  as  more 
work  is  done.  (I  hasten  to  add  that  no  information 
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about  the  proteins  is  used  in  the  analysis  except  the 
length  and  diameter  of  the  channel;  parameters 
were  not  adjusted  in  any  way.) 

We  conclude  that  the  PDD  equations  seem  to  be 
an  adequate  model  open  channels.  It  is  surprising 
that  the  PDD  equations  work  as  well  as  they  do, 
given  their  evident  inadequacies.  I  imagine  they 
work  this  well  because  the  fixed  charge  density  of 
channels  is  large  (~3  x  10  21  cm-3 )  compared  to  the 
concentration  of  ions  outside  the  channel  (2xl019 
to  lx  10 21  cm-3);  because  the  biological  range  of 
voltages  is  quite  limited  (±200  mV );  and  because 
the  PDD  model  uses  effective  parameters.  Eisen- 
berg,  Chen,  and  Schuss  have  recently  shown  how 
the  PDD  equations  can  be  derived  in  single  file 
systems  like  channels  that  conduct  one  ion  at  a 
time.  The  PDD  equations,  or  equations  quite  like 
them,  describe  the  mean  properties  of  ensembles  of 
Langevin  equations,  each  of  which  specifies  the 
motion  of  a  single  ion  (of  a  particular  type  moving 
from  a  given  side  of  the  channel),  each  of  which  is 
coupled  to  its  own  reaction  field  described  by  a 
Poisson  equation  and  boundary  conditions. 

The  PDD  equations  are  just  a  first,  low  resolu¬ 
tion  description  of  open  channels.  More  realistic 
models  (using  Monte  Carlo  simulations  called 
molecular  dynamics  in  the  world  of  proteins  [7,  8, 
20])  are  needed  to  provide  insight  with  atomic 
resolution.  It  is  likely  that  many  critical  functions 
of  channels  and  enzymes  will  be  best  understood 
this  way -by  atomic  resolution  simulations  that 
include  flux-but  biologists  cannot  do  the  simula¬ 
tions  themselves:  their  simulations  of  atomic 
resolution  have  been  confined  strictly  to  equili¬ 
brium  as  have  those  of  most  chemists.  Much  help  is 
needed  from  the  community  of  computational 
electronics  if  flux,  and  electrical  potentials  at  the 
electrodes  (i.e.,  boundaries)  of  the  system,  are  to  be 
included  in  chemical  simulations  of  atomic  detail. 

Ionic  channels  are  so  important  biologically,  but 
so  well  defined  physically,  that  they  are  an  ideal 
object  of  biophysical  investigation.  The  techniques 
of  computational  electronics  and  molecular  biol¬ 
ogy  can  be  joined  together  to  determine  how  ionic 
channels  work.  Perhaps  the  same  will  prove  true  of 


many  other  chemical  and  biological  systems,  but  it 
is  wise  to  try  the  simple  ones  first.  Nothing  is  likely 
to  be  simpler  physically  than  a  hole  in  the  wall. 
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INTRODUCTION 

At  the  inception  of  NEMO  in  1993,  quantum 
transport  programs  were  in  their  infancy  as 
compared  to  the  physical  comprehensiveness  of 
semiclassical  Monte  Carlo  simulators.  The  existing 
software  only  addressed  a  limited  range  of 
quantum  devices  and  quantum  transport  theory. 
The  NEMO  project  was  conceived  to  create  a 
single  software  package  that  would  simultaneously 
include  bandstructure  effects,  self-consistent  char¬ 
ging  effects,  and  incoherent  scattering  effects,  and 
be  flexible  enough  to  model  a  wide  variety  of 
device  designs. 

Texas  Instruments  and  the  collaborating  uni¬ 
versities  evaluated  a  number  of  different  formal¬ 
isms  (see  Fig.  1)  and  chose  the  Non-equilibrium 
Green  Function  (NEGF)  approach  as  the  most 
general  and  flexible  alternative.  The  multiple 
sequential  scattering  algorithm  was  incorporated 
within  the  NEGF  formalism  and  the  other 
approaches  were  eventually  dropped.  The  theory 
[1,  2]  implemented  in  NEMO  and  simulation 
results  [3  -  5]  generated  with  NEMO  can  be  found 
in  other  publications. 

This  paper  addresses  the  software  engineering 
aspects  of  writing  NEMO  in  terms  of  the  basic 
software  design  and  some  of  the  most  important 
software  methods  we  used  to  facilitate  the  tasks  of 
both  the  programmer  and  the  user.  The  first 
section  reviews  the  fundamental  program  design. 
This  is  followed  by  a  discussion  of  the  event-driven 
methodology  used  to  construct  the  simulation 
parameters  user  interface.  The  next  section  reviews 
the  method  used  to  transfer  data  between  NEMO 
data  structures,  files,  and  the  graphical  user 
interface.  The  concluding  section  summarizes  the 
development  environment  and  software  tools  used 
to  construct  NEMO. 

FUNDAMENTAL  PROGRAM  DESIGN 

The  basic  content  of  the  NEMO  software  is  shown 
in  Figure  2  .  At  the  heart  lies  the  NEGF  formalism 


FIGURE  1  Quantum  transport  theories  pursued  during  the 
NEMO  project.  The  light  gray  background  indicates  the 
incorporation  into  the  NEGF  approach.  The  dark  background 
indicates  approaches  that  were  dropped. 


and  open  system  boundary  conditions.  The  next 
shell  lists  the  physical  models  which  may  be 
included  within  several  different  approximations. 
The  outer  shell  lists  the  software  algorithms 
and  features  that  support  the  interior  calculation 
code. 


FIGURE  2  Content  of  the  NEMO  software.  Non-equili¬ 
brium  Green  function  theory  and  novel  quantum  boundary 
conditions  are  at  the  core.  Various  physical  phenomena  can  be 
blended  into  the  theory  on  a  firm  footing.  Software  engineering 
enables  the  efficient  numerical  simulation  of  the  theory  and 
provides  user  friendly  access. 
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BATCH  VS.  GUI  PROGRAM  DESIGN 

In  its  original  form,  NEMO  was  a  command-line 
program.  This  “NEMO-Batch”  version  was  later 
augmented  with  a  Graphical  User  Interface  (GUI) 
that  provides  graphical  methods  for  entry  of  input 
parameters,  plots  calculation  results  both  during 
and  after  the  calculation,  and  allows  the  user  to 
compare  data  and  calculations  all  in  the  same 
program.  Unlike  NEMO-Batch,  which  performs 
the  simulation  without  interruption  until  comple¬ 
tion,  the  NEMO-GUI  program  allows  the  user  to 
start,  interrupt,  and  abort  calculations  at  any  time. 

Despite  the  obvious  advantages  of  the  GUI 
interface,  NEMO-Batch  was  still  very  useful.  Since 
it  excludes  all  GUI-related  functionality,  NEMO- 
Batch  uses  less  memory,  runs  faster,  and  is  easier 
to  port  to  different  computer  platforms  than 
NEMO-GUI.  Consequently,  we  devised  a  method 
to  retain  the  NEMO-Batch  code  as  the  heart  of  the 
NEMO-GUI  program.  To  completely  understand 
how  NEMO  operates,  we  must  examine  the  design 
of  both  the  batch  and  GUI  versions  of  NEMO. 

Figure  3(a)  illustrates  the  fundamental  opera¬ 
tion  of  NEMO-Batch.  The  Input  Deck  is  a  text  file 
that  contains  the  input  parameter  required  for  the 
calculation.  The  user  calls  NEMO-Batch  on  a 
command  line  using  the  filename  of  the  Input 
Deck  as  the  sole  argument.  NEMO-Batch  reads 
and  processes  the  Input  Deck,  runs  the  computa¬ 
tion,  stores  the  results  in  output  data  files,  and 
then  exits.  A  NEMO-Batch  run  can  perform  the 
calculation  for  a  single  bias  point  or  a  sequence  of 
bias  points.  While  a  current  vs.  voltage  calculation 
is  a  standard  output,  many  other  outputs  are 
available,  including  charge  density,  band  profile, 
and  transmission  coefficients. 

As  shown  in  Figure  3(b)  the  NEMO-GUI 
operation  includes  most  of  the  NEMO-Batch 
code.  This  design  requires  a  separation  of  the 
GUI  and  batch  code  into  different  modules.  The 
NEMO-GUI  compilation  includes  all  of  the 
program  modules  whether  they  are  batch  or 
GUI-related.  The  NEMO-Batch  compilation  ex¬ 


cludes  purely  GUI  modules  from  the  executable. 
There  are  some  cases  where  a  mixture  of  batch  and 
GUI  code  is  unavoidable,  particularly  in  message 
routines  and  the  functions  that  plot  and  store 
calculation  results  during  the  calculation.  For 
those  situations,  a  preprocessor-defined  flag  effec¬ 
tively  deletes  GUI-related  code  from  any  routines 
with  a  mix  of  batch-GUI  code. 

Both  NEMO-Batch  and  NEMO-GUI  use  the 
same  verification  procedures  to  check  the  validity 
of  the  input  parameters.  The  validation  routine 
determines  the  parameters  required  for  the 
requested  calculation  and  checks  for  their  pre¬ 
sence  in  the  input  deck.  If  any  material  or  model 
parameters  are  missing,  NEMO  provides  default 
values  from  its  internal  database.  In  this  manner, 
users  can  either  accept  the  default  parameters  or 
input  their  own  parameters.  This  feature  ad¬ 
dresses  the  needs  of  the  device  physicist  who 
wants  detailed  control  over  the  calculation 
process  and  the  device  engineer  who  only  wants 
to  determine  general  device  behavior  as  quickly 
as  possible. 

Even  though  they  use  the  same  validation 
routines,  NEMO-Batch  and  NEMO-GUI  have 
very  different  means  of  responding  to  validation 
errors.  For  NEMO-Batch,  the  validation  routine 
only  needs  to  check  parameters  read  from  the 
Input  Deck.  If  an  error  occurs  in  the  verification 
procedure,  NEMO-Batch  must  exit  and  report  the 
error  to  a  log  file. 

NEMO-GUI  not  only  checks  the  Input  Deck 
values,  but  also  validates  parameters  as  the  user 
enters  them  into  the  GUI.  If  an  error  occurs  for 
either  case,  NEMO-GUI  displays  an  error  message 
with  advice  on  why  the  error  occurred  and  how  to 
remedy  the  problem.  The  user  must  input  the 
correct  information  before  starting  the  calculation 
or  saving  the  input  parameters.  This  procedure 
ensures  that  an  Input  Deck  created  using  NEMO- 
GUI  will  be  valid  for  NEMO-Batch  operation. 
Consequently,  even  when  using  NEMO-Batch  for 
the  calculation,  NEMO-GUI  provides  the  most 
convenient  method  to  construct  the  input  deck. 
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FIGURE  3  Basic  design  of  NEMO  program  for  (a)  batch  and  (b)  GUI  operation.  Input  parameters  are  classified  as  device 
structure  parameters,  material  parameters,  and  model  parameters.  Device  structure  parameters  specify  the  device  geometry,  layer 
composition,  and  any  other  external  physical  parameters  such  as  the  terminal  connections  and  temperature.  Material  parameters 
include  the  band  structure  information  such  as  effective  mass,  energy  gaps,  and  band  offsets.  Model  parameters  specify  the 
theroretical  models  used  for  the  calculation  such  as  band  structure,  charge  self-consistency,  scattering  models,  simulation  domains, 


and  grids. 


SIMULATION  PARAMETER  INPUT 
METHODOLOGY 

NEMO  incorporates  a  wide  spectrum  of  models 
with  varying  tradeoffs  between  accuracy  and 


speed.  This  range  of  models  is  essential  given  the 
large  range  of  devices  we  wish  to  model  for 
different  scientific  and  engineering  applications. 

The  downside  of  this  flexibility  is  that  NEMO 
has  over  100  simulation  parameters,  many  of 
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which  interact  in  complex  ways.  In  traditional 
simulators,  the  user  must  comprehend  all  of  the 
simulation  parameters,  carefully  select  the  essen¬ 
tial  parameters  for  a  particular  study,  and  ensure 
that  all  parameter  values  are  self-consistent.  This 
can  be  a  daunting  task  for  even  the  simplest 
simulation. 

It  is  equally  difficult  for  the  programmer  to  deal 
with  for  two  reasons:  (1)  New  models  and  para¬ 
meters  added  to  the  simulator  must  be  reflected  in 
the  GUI  as  quickly  as  possible,  and  (2)  simulation 
parameters  should  only  be  defined  in  a  single 
module  accessed  by  both  the  theory  and  the  GUI 
to  ease  the  code  maintenance. 

Our  approach  to  this  problem  was  to  create  the 
GUI  interface  at  run-time  based  on  a  hierarchical 
system  of  simulation  parameters.  These  simulation 
parameters  are  defined  in  a  single  module  shared 
by  the  GUI  and  theory  code.  A  key  advantage  to 
this  method  is  that  it  insulates  changes  in  the 
simulation  parameter  set  from  changes  in  the  GUI 
and  vice  versa.  The  next  section  reviews  the 
method  used  to  create  the  run-time  GUI  interface. 

The  most  basic  level  of  the  hierarchy  directs  the 
choice  of  fundamental  models  such  as  the  poten¬ 
tial,  band  structure,  and  scattering  models.  NEMO 
determines  the  data  structures  required  by  these 
models  and  creates  a  new  GUI  interface.  The  GUI 
displays  default  choices  that  the  user  can  either 
accept  or  change. 

When  the  user  prompts  NEMO  to  accept  these 
parameters,  NEMO  tests  for  valid  user  input  and 
enforces  consistency  between  interacting  para¬ 
meters.  For  example,  if  one  parameter  value  must 
be  larger  than  another  parameter,  NEMO  forces 
adherence  to  this  rule. 

After  validating  the  selected  options,  NEMO 
determines  the  data  structures  required  by  the  next 
level  on  the  hierarchy,  creates  and  displays  a  new 
GUI  interface,  and  repeats  the  process  until  the 
parameter  set  is  sufficient  to  run  the  simulation.  If 
the  user  goes  back  to  any  portion  of  the  hierarchy 
to  select  new  options,  parameters  at  higher  levels 
of  the  hierarchy  remain  intact,  while  parameters  at 


the  lower  levels  are  added  and  subtracted  as 
appropriate. 

This  event-driven  procedure  only  presents  the 
parameters  needed  by  the  selected  models.  All 
other  parameters  are  either  hidden  or  are  disabled 
from  user  entry.  Limiting  the  parameter  display 
in  this  manner  greatly  reduces  the  sheer  number 
of  simulation  parameters  presented  to  the  user. 
Since  NEMO  provides  default  values  for  all 
simulation  parameters,  the  user  can  often  specify 
a  few  of  the  top-level  parameters  and  then 
prompt  NEMO  to  use  the  default  values  for  the 
remainder  of  the  hierarchy.  In  most  cases,  this 
approach  generates  a  reasonable  simulation. 
Nevertheless,  the  user  has  the  option  of  adjusting 
all  parameters  down  to  the  lowest  level  of  the 
hierarchy,  thus  fulfilling  a  primary  goal  of 
NEMO  to  give  the  user  full  control  over  all 
aspects  of  the  simulation. 

DATA  TRANSFER  AND  RUN-TIME 
GUI  CREATION 

The  NEMO  program  must  transfer  large  amounts 
of  information  between  data  structures,  files,  and 
the  GUI.  For  a  small  program,  it  is  sufficient  to  write 
customized  routines  for  these  operations,  but  this 
approach  became  impractical  as  NEMO  grew  in 
size.  To  resolve  this  problem,  we  designed  a  generic 
algorithm  for  transferring  information  to  and  from 
data  structures.  In  addition  to  transferring  informa¬ 
tion,  this  same  algorithm  creates  portions  of  the 
GUI  at  run-time  for  parameter  entry.  A  detailed 
description  of  this  algorithm  is  beyond  the  scope  of 
this  paper,  but  due  to  its  importance  to  NEMO  and 
its  utility  to  both  the  user  and  the  programmer,  we 
will  highlight  the  basic  features. 

We  refer  to  this  process  as  the  MemberDescrip- 
tor  algorithm.  We  defined  a  Member  Descriptor 
data  structure  that  contains  all  of  the  information 
needed  to  translate  values  between  a  data  structure 
member  and  its  destination.  The  MemberDescrip- 
tor  includes  a  character  string  identifier,  the 
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memory  location  of  a  data  structure  member 
relative  to  the  data  structure  address,  and  an 
enumerated  flag  that  defines  the  data  type  (num¬ 
ber,  character  string,  option  list,  etc.).  Each  data 
structure  has  at  least  one  associated  MemberDe- 
scriptor  array.  Each  element  of  the  MemberDe- 
scriptor  array  corresponds  to  one  of  the  data 
structure  members.  To  transfer  information  from 
the  data  structure,  the  translation  routine  uses  the 
base  address  of  the  data  structure  and  the 
MemberDescriptor  array  to  derive  the  values  of 
each  data  structure  member.  The  inverse  of  this 
procedure  transfers  values  from  either  the  GUI  or 
a  file  into  each  data  structure  member. 

For  example,  assume  we  wish  to  transfer  the 
value  of  an  integer  data  structure  member  to  a  file. 
The  translation  routine  adds  the  relative  position 
of  the  integer  data  structure  to  the  base  address  of 
the  data  structure.  To  determine  the  actual  integer 
value,  the  translator  casts  the  contents  of  the 
memory  location  to  an  integer.  The  derived  integer 
value  is  stored  in  the  file  in  the  general  format: 

<  identifier  >  =  <  value  > 

where  the  <  identifier  >  is  the  character  string 
identifier  set  in  the  MemberDescriptor  data 
structure  and  <  value  >  is  the  integer  number. 
Reversing  this  process  inputs  the  same  information 
from  the  file  and  stores  the  integer  value  into  the 
data  structure. 

This  procedure  is  used  in  many  programs  as  a 
generic  method  for  file  input  and  output  [6].  For 
NEMO,  we  extended  the  algorithm  to  transfer 
information  between  the  GUI  and  the  internal 
data  structures.  Figure  4  illustrates  how  data 
values  are  transferred  between  data  structures 
and  a  GUI  panel  consisting  of  an  option  list,  a 
text  box,  a  check  box,  and  a  graphics  plot  with 
adjustable  cursors. 

For  the  simulation  parameter  input  method 
discussed  in  the  last  section,  we  use  the  Member- 
Descriptor  algorithm  to  create  the  GUI  interface 
for  each  hierarchy.  This  dynamic  design  vastly 
enhances  the  usability  of  the  GUI  compared  to  a 


traditional  static  design  that  would  require  a  fixed 
set  of  GUI  elements  throughout  the  program 
operation. 

The  MemberDescriptor  approach  saves  a  great 
deal  of  time  and  trouble  for  the  programmer. 
Adding  a  new  data  structure  no  longer  requires 
customized  code  to  transfer  values  between  data 
structures,  files,  and  the  GUI.  Only  a  new 
MemberDescriptor  array  is  needed.  In  fact,  even 
software  team  members  who  are  totally  ignorant 
of  GUI  programming  techniques  can  effectively 
“program”  the  GUI  in  this  manner. 


FIGURE  4  Generic  data  transfer  between  NEMO  data 
structures,  the  GUI,  and  files.  The  structure  and  representations 
of  the  simulation  parameters  in  the  GUI  and  the  files  are  fully 
specified  in  the  data  structures. 
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DEVELOPMENT  ENVIRONMENT 

The  following  summarizes  key  elements  of  the 
environment  used  to  create  the  NEMO  simulator: 

Programming  Language  Most  of  the  program 
was  coded  using  ANSI  C  with  some  FORTRAN 
77  and  FORTRAN  90  code  used  in  modules  that 
required  the  greatest  speed. 

Development  Platforms  Code  development  oc¬ 
curred  on  Unix  workstations  either  using  HP 
Apollo  or  SGI  platforms.  We  also  ported  NEMO 
to  the  Sun,  IBM,  and  Linux  Unix  platforms. 

Program  Library  System  To  facilitate  code 
sharing  and  revision  updates,  we  utilized  a  standard 
program  library  system  called  RCS  (Revision 
Control  System).  The  RCS  system  manages  a 
check-out  and  check-in  system  that  only  allows 
one  programmer  to  edit  a  file  at  a  time.  The  RCS 
system  also  stores  changes  between  code  versions 
and  can  recover  any  previous  version  of  the  project. 

Graphical  User  Interface  The  NEMO  GUI  was 
written  in  Motif  which  uses  C-callable  routines  to 
create  and  control  GUI  objects  on  a  wide  variety 
of  Unix  platforms. 

GUI  Tools  The  static  elements  of  NEMO  were 
designed  using  the  XDesigner  software  tool.  The 
spreadsheet  table  and  the  2D,  3D,  and  contour 
plot  tools  were  programmed  using  the  3rd  party 
XRT  Widget  set. 

Run-time  Debugger  NEMO  relies  heavily  on 
dynamic  memory  creation  techniques  that  are 
extremely  difficult  to  debug  with  conventional 
debuggers.  The  Purify  debugger  provided  the 
means  to  find  memory  violations  incurred  at  run¬ 
time  and  greatly  simplified  the  task  of  tracking 
down  these  errors. 

Parallelization  NEMO-Batch  has  been  paral¬ 
lelized  using  MPI  (Message  Passing  Interface)  at 
the  outer  loop  of  current-voltage  characteristics. 
Individual  bias  points  are  computed  in  parallel. 

CONCLUSION 

We  have  presented  highlights  of  the  NEMO 
software  project  germane  to  construction  of  a 


large-scale  device  simulator.  It  is  hoped  that  these 
methods  will  prove  useful  to  other  groups  engaged 
in  similar  types  of  software  projects,  whether  or 
not  they  pertain  to  quantum  device  simulations. 
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The  problem  of  obtaining  the  lasing  modes  and  corresponding  threshold  conditions  for 
vertical  cavity  surface  emitting  lasers  (VCSELs)  is  formulated  as  a  frequency-dependent 
eigenvalue  problem  in  required  gain  amplitudes  and  corresponding  fields.  Both  index 
and  gain  guiding  are  treated  on  an  equal  footing.  The  complex  gain  eigenvalues  define 
necessary  but  not  sufficient  conditions  for  lasing.  The  actual  lasing  frequencies  and 
modes  that  the  VCSEL  can  support  are  then  determined  by  matching  the  gain  necessary 
for  the  optical  system  in  both  magnitude  and  phase  to  the  gain  available  from  the  laser’s 
electronic  system.  Examples  are  provided. 


Keywords:  VCSELs,  modes,  lasing,  modeling,  gain  eigenvalues 


1  INTRODUCTION 

In  the  simulation  of  edge  emitting  lasers,  the 
optical  field  can  usually  be  modeled  simply  by  a 
predetermined  set  of  Fox-Li  quasimodes  of  the 
passive  cavity  [1].  The  threshold  condition  can  then 
be  expressed  as  mode  gain  times  mode  lifetime 
equals  unity.  No  laser  previously  has  presented  the 
challenge  to  optical  simulation  now  presented  by 
vertical  cavity  surface  emitting  lasers  (VCSELs). 

The  challenge  to  VCSEL  simulation  is  not  just  a 
much  more  complicated  optical  cavity  geometry 
for  analysis  -  although  this  problem  alone  is 


significant  -  but  that  quasi-mode  analysis,  itself, 
can  no  longer  be  relied  upon.  The  mirrors  are 
distributed  and  lateral  confinement  may  be  pro¬ 
duced  by  a  combination  of  index  and  gain  guiding. 
(See,  for  example,  Ref.  [2].)  Thus  the  VCSEL 
optical  cavity  boundaries,  along  with  the  passive 
cavity  modes  and  conventional  parameters  such  as 
the  photon  lifetime,  are  often  poorly  defined. 

2  THEORY 

At  the  threshold  for  lasing,  the  optical  field  will  be 
self-supporting  in  the  presence  of  the  gain  supplied 
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by  the  laser.  Consider  an  open  VCSEL  optical 
cavity  with  tensor  electric  susceptibility  x  (?,  w)  = 
Xcav(^)  +  Xg(?>  u)  where  xg(r,a/)  is  the  necessa¬ 
rily  complex  susceptibility  representing  the  laser 
gain  inside  the  active  region  provided  under  bias, 
and  Xcav(^^)  is  the  potentially  complex  suscept¬ 
ibility  representing  the  rest  of  the  VCSEL.  Within 
the  semiclassical  approximation,  lasing  requires  [3] 

E(r:  lj) 


where  E(r,uS)^ is  the  electric  component  of  the 
lasing  field,  GC2iV(r,rf ,u>)  is  the  tensor  Green’s 
function  for  radiation  from  a  current  source  within 
the  open  VCSEL  cavity  defined  by  Xcav>  and 
juj£0Xg(r\  cj)  •  E(r! ,  lj)  acts  as  an  equivalent  cur¬ 
rent  source. 

If  the  spatial  distribution  of  the  gain  is  known  to 
good  approximation,  then  Eq.  (1)  reduces  to  an 
eigenvalue  problem  in  the  necessary  gain  ampli¬ 
tudes  for  lasing  and  the  corresponding  lasing 
fields.  That  is,  if  x^(r,o;)  =  /c(a;)^(?)  where 
Xg  (r )  is  known,  then  Eq.  (1)  gives 

£(r,  u)  -  g(uj)E(r,  u),  (2) 

where  Q(uj)  is  an  integral  operator  defined  by 
GE(r)  =jue0 1  dr1  Gcav(F,  F')  •  [xf(F')  •  E(r% 

(3) 

Note  that  this  eigenvalue  problem  need^nly  be 
evaluated  over  regions  for  which  xg  (?)  is 
significant,  that  is,  over  the  active  region  of  the 
laser.  The  solutions  to  this  eigenvalue  problem  will 
be  complex  gain  eigenvalues  and  the  correspond¬ 
ing  fields  as  a  continuous  function  of  frequency,  in 
contrast  to  the  more  familiar  case  of  finding  the 
discrete  real  frequency  eigenvalues  and  modes  of  a 
closed  passive  cavity. 

It  is  also  possible  to  repartition  the  VCSEL 
susceptibility  to  take  part  of  xCav  out  °f  the 


/ 


drfGQw(r,rf  ,u>) 


\jue0Xg{rf ,J)  •  E(r' ,w)], 


(1) 


Green’s  function  and  put  it  in  Eq.  (2)  as  a  second 
source  term.  This  may  assist  in  the  solution  of 
problems  for  which  the  Green’s  function  is  not 
easily  obtained.  Extending  the  formulation  in  this 
way  changes  the  ordinary  eigenvalue  problem  of 
Eq.  (2)  into  a  generalized  eigenvalue  problem. 

The  solutions  to  Eq.  (2)  establish  necessary  but 
not  sufficient  conditions  for  the  VCSEL  to  lase. 
The  frequencies  at  which  the  VCSEL  can  actually 
lase  are  those  for  which  the  complex  gain 
susceptibility  necessary  for  the  optical  system  can 
be  matched  to  the  complex  gain  susceptibility 
available  from  the  laser  electronic  system  in  both 
magnitude  and  phase. 


3  EXAMPLE 

To  illustrate  the  use  of  this  formulation,  lasing  in 
the  VCSEL  cavity  diagrammed  in  Figure  1  was 
considered  [2].  This  cavity  allows  for  relatively 
easy  analysis  while  allowing  much  of  the  essential 
physics  of  VCSELs  to  be  modeled.  Xcav(?,^)was 
taken  as  that  of  the  planarly  layered  structure 
including  the  nominally  lossy  quantum  well  layer, 
but  with  no  lateral  confinment.  Note,  for  simpli¬ 
city,  all  susceptibilities  were  treated  as  scalars.  The 
spatial  distribution  of  the  gain  susceptibility  was 
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FIGURE  1  Model  VCSEL  cavity.  Ao  =  980nm. 
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approximated  as  uniform  inside  the  active  region 
and  zero  elsewhere.  Thus  for  later  convenience  we 
set  x®(r)  =  — 7  within  the  9jimxl0  pmx3nm 
quantum  well  active  region,  and  xf\?)  =  0  else- 
where. 

The  Green’s  functions  for  such  a  planarly 
layered  structure  are  well  known  [4],  and  only 
those  with  both  source  and  field  coordinates  within 
the  thin  quantum  well  active  layer  were  required. 
The  Green’s  functions  were  obtained  in  Fourier- 
space  where  the  only  portion  of  the  calculation  that 
requires  a  computer  is  the  calculation  of  the 
distributed-Bragg-reflector  (DBR)  reflectivities  vs. 
frequency,  incident  angle  and  polarization,  which 
may  be  performed  using  several  equivalent  meth¬ 
ods  [5,  6].  Of  course,  calculation  of  the  Green’s 
function  for  more  complex  cavity  structures  could 
require  computationally  intensive  numerical  meth¬ 
ods,  while  taking  advantage  of  the  cylindrical 
symmetry  in  many  VCSELs  could  significantly 
reduce  the  computational  load.  However,  while  of 
obvious  practical  importance,  how  the  Green’s 
function  is  obtained  or  what  coordinate  system  is 
used  is  of  no  conceptual  importance  in  this 
formulation.  Note  that  the  potentially  difficult 
problem  of  finding  the  Green’s  function  would 
only  need  to  be  solved  once  for  all  bias  conditions, 
as  the  Green’s  functions  are  independent  of  bias  as 
long  as  effects  such  as  thermal  expansion  of  the 
cavity  are  ignored. 

Once  the  Green’s  function  was  calculated,  the 
gain  eigenvalue  problem  of  Eq.  (4)  was  discretized 
and  solved.  Using  a  moment  method,  the  field, 
approximated  as  constant  over  the  width  of  the 
well,  was  expanded  in  rectangular  testing  functions 
with  unknown  complex  coefficients  in  the  (x—y) 
plane  of  the  well,  i.e. 


E{x,y) 


(4) 


where  rect[(r?  —  r}0)/d]  is  a  rectangular  pulse  of 
amplitude  one  and  full  width  d  centered  at  r)0,  and 


the  discrete  variable  ft  =  (nx,  ny)  labels  the  grid  sites. 
(For  notational  convenience,  the  various  depen¬ 
dencies  on  frequency  are  no  longer  indicated 
explicitly.)  Then  the  inner  products  of  both  sides 
of  Eq.  (4)  were  taken  with  the  same  rectangular 
testing  functions  (Galerkin’s  method  [7]).  The  result 
was  a  finite  set  of  linear  equations  for  the  unknown 
field  coefficients  E(n)  of  the  same  form  as  Eq.  (2), 

VMoXf  (« ) 2cav («, «')  •  E(n')  =”£(«).  (5) 

The  corresponding  discretized  Green’s  function 
was  obtained  from  the  continuous  Green’s  func¬ 
tion  'in  Fourier-space  by 


.(»>-')  =  // 


dk  dk  eJkxdx  ~n>x )  ejkydy  ^  ~n? ) 

x  y  ^ 


4sin2(kxdx/2)sin2(kydy/2)^  fu  ,  ^ 

X  ^k\k)dxdy  G^kx,ky)  (6) 


for  these  rectangular  testing  functions.  For  this 
work,  a  commercially  available  (IMSL)  routine  for 
generalized  complex  eigenvalue  problems  was  used 
to  solve  Eq.  (5). 

Figure  (2)  shows  the  first  three  gain  eigenvalues 
k(o>)  (two  are  essentially  degenerate)  plotted  at 


FIGURE  2  First  three  complex  gain  eigenvalues  at  discrete 
intervals  in  frequency;  two  are  essentially  degenerate.  Lasing  is 
possible  only  where  the  eigenvalue  curves  crosses  the  real  axis, 
with  the  values  of  the  gain  eigenvalues  at  these  crossings 
defining  the  threshold  condition  for  lasing. 
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constant  intervals  in  frequency.  Approximating 
the  gain  susceptibility  available  from  the  electronic 
system  of  the  laser  under  bias  as  purely  imaginary, 
lasing  is  only  possible  where  one  of  the  gain 
eigenvalue  curves  crosses  the  real  axis,  such  that 
Xg(r,u;)  =  K(cj)xf\r)  is  also  purely  imaginary. 
The  gain  eigenvalue  at  the  crossing  frequency 
defines  the  threshold  gain  susceptibility.  Approx¬ 
imating  the  gain  susceptibility  as  purely  imaginary 
is  done  here  for  illustrative  purposes  only.  In 
reality  the  actual  lasing  modes  of  the  system  would 
be  determined  by  the  intersection  of  the  gain 
eigenvalues  with  a  curve  describing  the  Kramers- 
Kronig  relation  between  the  real  and  imaginary 
parts  of  the  gain  susceptibility.  The  large  area 
active  region  considered  in  this  example  produces 
a  very  tight  spacing  of  gain  eigenvalues  in 
frequency,  but  the  gain  amplitudes  required  for 
lasing  are  significantly  different  because  of  the 
different  overlaps  of  the  self-consistently  calcu¬ 
lated  lasing  modes  with  the  active  region.  Figure 
(3)  shows  the  field  patterns  of  the  first  and  third 
lasing  modes  inside  the  gain  region. 


4  CONCLUSION 

The  problem  of  obtaining  the  lasing  modes  and 
corresponding  threshold  conditions  for  VCSELs 
has  been  formulated  as  a  frequency-dependent 
eigenvalue  problem  in  required  gain  amplitudes 
and  corresponding  fields.  Index  and  gain  guiding 
are  treated  on  an  equal  footing.  The  complex  gain 
eigenvalues  define  necessary  but  not  sufficient 
conditions  for  lasing.  The  actual  lasing  frequencies 
and  modes  that  the  VCSEL  can  support  are  then 
determined  by  matching  the  gain  necessary  for  the 
optical  system  in  both  magnitude  and  phase  to 
that  available  from  the  laser  electronic  system. 
With  this  formulation  the  lasing  modes  and 
corresponding  threshold  conditions  are  well  de¬ 
fined  even  when  the  optical  cavity  boundaries  and 
conventional  cavity  parameters  such  as  the  photon 
lifetime  are  not. 
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FIGURE  3  Lasing  fields  at  (a)  A  ~  1.0003  \0  and  (b)  A  ~ 
1.0001  X0  as  a  function  of  position  within  the  gain  region. 


This  formulation  also  has  the  practical  advan¬ 
tage  that  the  problem  of  obtaining  the  self- 
supporting  lasing  fields  of  a  VCSEL  cavity  is 
separated  into  two  distinct  and  already  thoroughly 
studied  problems:  that  of  finding  the  optical 
Green’s  functions  for  a  fixed  radiation  source  in 
an  open  cavity  and  that  of  solving  a  complex 
eigenvalue  problem.  Thus,  this  formulation  pro¬ 
vides  a  framework  for  the  application  of  a  large 
preexisting  knowledge  base  to  the  relatively  new 
challenge  of  modeling  VCSELs.  For  example,  in 
the  specific  implementation  used  for  this  work,  the 
cavity  Green’s  function  was  obtained  by  textbook 
methods  [4-6]  and  the  generalized  complex 
eigenvalue  problem  was  solved  using  a  commer¬ 
cially  available  numerical  routine. 
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Employing  a  recently  developed  efficient  cellular  automaton  technique  for  solving 
Boltzmann’s  transport  equation  for  realistic  devices,  we  present  a  detailed  study  of  the 
carrier  dynamics  in  GaAs  avalanche  p-i-n  (IMPATT)  diodes.  We  find  that  the  impact 
ionization  in  reverse  bias  p-i-n  diodes  with  ultrathin  (less  than  50  nm)  intrinsic  regions  is 
triggered  by  Zener  tunneling  rather  than  by  thermal  generation.  The  impact  generation 
of  hot  carriers  occurs  mainly  in  the  low-field  junction  regions  rather  than  in  the  high 
field  intrinsic  zone.  The  calculations  predict  significantly  more  minority  carriers  on  the 
«-side  than  on  the  p- side. 


Keywords:  Cellular  Automata,  IMPATT  diodes,  zener  tunneling,  impact  ionization,  avalanche 
breakdown 


1.  INTRODUCTION 

The  carrier  dynamics  in  modern  nanometer  size 
devices  is  dominated  by  hot  carrier  effects  and 
nonlocal  transport  phenomena  such  as  tunneling 
and  impact  ionization.  A  realistic  prediction  and 
understanding  of  these  effects  in  real  devices  that 
operate  at  room  temperature  requires  at  least  the 
solution  of  the  full  semiclassical  Boltzmann 
equation.  A  few  years  ago,  we  have  developed  a 
novel  method,  the  Cellular  Automaton  (CA) 


approach  namely,  that  provides  a  computationally 
efficient  scheme  for  solving  Boltzmann’s  equation 
in  position  and  momentum  phase  space  [1-2]. 
Since  then,  we  have  significantly  refined  this 
approach  [3-4]  and  enhanced  its  speed  and 
robustness;  a  detailed  review  of  the  present  status 
of  the  CA  method  has  been  given  very  recently  [5]. 
The  present  paper  focuses  on  the  application  of 
this  scheme  to  a  physically  intriguing  situation 
where  the  carrier  dynamics  is  highly  complex  and 
very  far  from  equilibrium.  We  present  a  micro- 
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scopic  analysis  of  the  carrier  dynamics  near 
avalanche  breakdown  in  GaAs  and  AlGaAs 
IMPATT  diodes. 


2.  THE  CA  METHOD 

We  briefly  summarize  the  highlights  of  the  CA 
method  [1-5].  A  cellular  automaton  consists  of  a 
lattice  with  a  finite  number  of  states  attached  to 
each  lattice  site  that  can  be  interpreted  as 
pseudoparticles.  Their  dynamics  is  governed  by 
a  limited  set  of  local  transition  rules.  The  perhaps 
most  crucial  factor  that  sets  cellular  automata 
apart  from  standard  finite  differencing  methods  is 
the  reduction  of  all  physical  variables  to  a  finite 
set  of  discrete  values.  In  contrast  to  finite 
differencing  methods,  the  size  of  this  set  can  be 
kept  relatively  small  due  to  an  optimized  phase 
space  discretization.  This  discretization  utilizes 
the  fact  that  the  final  carrier  distribution  function 
changes  on  a  much  coarser  scale  in  phase  space 
than  typical  changes  of  phase  space  variables 
within  on  time  step.  In  our  present  implementa¬ 
tion,  we  use  a  two-dimensional  hexagonal  lattice 
in  real  space.  Attached  to  each  lattice  site  are  of 
the  order  of  105  momentum  states  that  form  a 
hexagonal  closed  packed  structure.  Transition 
rules  among  these  states  represent  the  quantum 
mechanical  collisions  as  well  as  the  drift  and 
diffusion  terms  in  the  Boltzmann  equation.  In  the 
simulation,  the  particle  dynamics  consists  mainly 
in  look-up  operations  of  pre-calculated  and 
hierarchical  scattering  tables.  This  results  in  a 
numerically  very  efficient  algorithm  for  the 
particle  dynamics  that  is  typically  faster  than 
the  standard  Monte  Carlo  scheme  by  a  factor  of 
30-50. 

From  a  physics  point  of  view,  in  particular 
concerning  scattering  mechanisms,  the  CA  and 
Monte  Carlo  scheme  are  equivalent.  Indeed,  we 
have  employed  both  methods  in  obtaining  the 
results  of  this  paper,  mostly  to  check  the  accuracy 
and  consistency  of  the  CA  results. 


3.  CARRIER  DYNAMICS  IN  IMPATT 
DIODES 

Recently,  GaAs  and  GaAlAs  based  IMPact 
Avalanche  Transit  Time  (IMPATT)  diodes  have 
been  fabricated  with  a  high  power  output  at 
frequencies  up  to  200  GHz  [6-8]  .  The  avalanche 
zone  of  such  a  diode  consists  of  a  p-i-n  diode  with 
highly  doped  n  and  p  regions  and  an  ultrathin 
intrinsic  zone  of  20  to  50  nm.  The  build-up  of  the 
carrier  avalanche  under  high  reverse  bias  is  a 
subtle  interplay  between  thermal  generation,  inter¬ 
band  (Zener)  tunneling,  impact  ionization,  and 
other  scattering  mechanisms  [9]. 


3.1.  Scattering  Rates 

The  present  simulations  incorporate  all  relevant 
standard  scattering  mechanisms  for  carriers,  such 
as  ionized  impurity,  plasmon,  intra-  and  inter¬ 
valley  phonon  scattering,  alloy  scattering,  and 
thermal  generation,  and  nonparabolic  electron  and 
hole  bands  [10].  Impact  ionization  is  accounted  for 
by  invoking  the  model  of  Kane  [11],  using  density 
of  states  that  have  been  calculated  with  the 
empirical  pseudopotential  method.  The  k*p  model 
of  Krieger  [12]  is  employed  to  calculate  the 
interband  tunneling  rate  for  direct  transitions  as 
a  function  of  electric  field.  The  absolute  magnitude 
of  the  impact  ionization  rate  of  electrons  and  holes 
is  chosen  as  to  reproduce  the  experimental 
ionization  coefficients  in  bulk  GaAs  and  AlGaAs 
[13-15]. 

In  Figures  1  and  2,  we  show  several  scattering 
rates  for  electrons  and  holes  in  GaAs.  This  figure 
reveals  that  the  impact  ionization  scattering  rate 
for  holes  is  markedly  lower  than  for  electrons.  An 
interesting  consequence  of  this  difference  is  illu¬ 
strated  in  Figure  2.  It  shows  the  average  distance 
an  electron  or  hole  travels  in  bulk  GaAs  between 
impact  ionization  scattering  events  when  the  field 
is  1  MV/cm.  In  contrast  to  the  quasi-ballistically 
moving  electrons,  the  slower  holes  suffer  many 
other  scattering  processes  before  they  are  able  to 
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FIGURE  1  Impact  ionization  scattering  rates  (labeled  by 
“impact”)  and  total  scattering  rates  (“total”),  in  units  of  1/fs, 
for  electrons  in  the  T  valley  and  heavy  holes,  respectively,  in 
bulk  GaAs  at  room  temperature  and  a  doping  level  of 
2xl018cm-3  as  a  function  of  energy  in  eV. 


FIGURE  2  Probability  per  distance  for  an  electron  (respec¬ 
tively,  hole)  in  bulk  GaAs  to  initiate  an  impact  ionization  along 
its  trajectory  of  a  length  specified  by  the  abscissa  in  nm.  The 
electric  field  is  1  MV/cm. 

impact  ionize.  This  leads  to  the  broad  distribution 
of  hole  trajectories  in  Figure  2. 


3.2.  The  Avalanche  Generation  Process: 

Qualitative  Picture 

We  now  consider  a  GaAs  p-i-n  diode  with  a  20  nm 
intrinsic  zone  and  a  carrier  concentration  of 
A2=^  =  2xl018cm-3.  The  density  in  the  intrinsic 
zone  is  set  to  n  =  2x  1015cmT3.  In  Figure  3,  we 
illustrate  schematically  the  time  development  of 


FIGURE  3  Schematic  picture  of  carrier  dynamics  in  a  highly 
doped  IMPATT  diode  with  ultrathin  /-zones  under  reverse  bias 
that  is  close  to  breakdown.  The  relevant  dynamical  processes 
are  Zener  tunneling,  labeled  by  (1),  impact  ionization  in  the 
depletion  zone  (labeled  by  (2)  and  only  shown  on  the  w-side  for 
simplicity),  and  impact  ionization  in  the  high  field  region, 
labeled  by  (3). 


the  carrier  distribution  in  the  diode  after  applying 
a  reverse  bias  of  5.5  eV. 

For  a  reverse  bias  between  4  and  5  V,  the 
resulting  high  field  of  approximately  1  MV/cm 
initiates  electron  hole  pair  generation  by  interband 
tunneling  (step  (1)  in  Fig.  3).  The  thermal 
generation  rate  is  found  to  be  negligible  compared 
to  the  tunneling  rate  for  this  device  geometry.  The 
generated  electrons  on  the  n-side  get  rapidly 
accelerated  by  the  electric  field,  move  away  from 
the  band  edge  and  gain  a  significant  amount  of 
energy.  The  key  point  is  that  the  electrons  dissipate 
their  excess  energy  efficiently  by  impact  ionization 
within  the  low  field  depletion  region  rather  than  in 
the  high  field  zone  (step  (2)).  The  holes  that  are 
generated  via  these  impact  ionization  processes  get 
accelerated  by  the  field  back  into  and  through  the 
whole  intrinsic  zone.  This  allows  the  holes  to  gain 
sufficient  excess  energy  so  that  they  induce  impact 
ionization  processes  already  within  the  i- zone.  This 
ignites  the  avalanche  process  (step  (3)). 
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Thus,  the  carrier  generation  in  GaAs  IMPATT 
diodes  with  ultrathin  /-zones  is  triggered  by  Zener 
tunneling  in  the  high  field  region,  whereas  the 
generation  process  itself  starts  within  the  depletion 
layers.  This  leads  to  a  pronounced  dark-space 
effect  in  the  impact  ionization  rates.  For  applied 
voltages  below  5.7  V,  this  carrier  generation  cycle 
does  not  lead  to  breakdown.  However,  it  takes 
almost  30  ps  after  switch-on  before  the  current  is 
fully  stationary  when  the  applied  bias  is  close  to 
this  breakdown  threshold  value. 

Since  the  impact  ionization  rate  and  the  drift 
velocity  of  electrons  is  higher  than  that  of  holes, 
the  electrons  get  more  efficiently  cooled  on  the  n- 
side  of  the  intrinsic  zone  than  the  holes  do  on  the 
opposite  side.  This  leads  to  a  slightly  higher 
average  energy  of  the  holes,  as  well  as  to  a  higher 
density  of  holes  on  the  p- side  than  electrons  on  the 
n-side.  In  the  avalanche  regime,  the  higher  hole 
density  effectively  screens  the  electric  field  in  the 
intrinsic  zone.  This  leads  to  an  interesting  effect  for 
short  /-zones  of  less  than  50  nm.  Once  the  reverse 
bias  exceeds  the  threshold  value  for  avalanche 
multiplication,  the  holes  are  able  to  flood  the 
whole  /-zone.  This  screens  the  electric  field  and 
effectively  impedes  the  avalanche  breakdown. 

33.  Quantitative  Results  and  Comparison 
with  Data 

To  substantiate  this  qualitative  picture,  Figures 
4(a) -4(d)  characterize  the  carrier  distribution  for 
a  reverse  bias  of  5.5  V  in  the  20  nm p-i-n  diode  (i.e., 
just  below  breakdown)  quantitatively.  The  spatial 
field  profile  is  drawn  as  grey  inset  in  these  figures  in 
order  to  show  the  spatial  extent  of  the  depletion 
zone.  The  electric  field  is  constant  within  the 
intrinsic  zone  and  amounts  to  1.1  MV/cm. 

The  energy  of  electrons  (Fig.  4(a))  reaches  a 
maximum  at  the  center  of  the  high  field  zone. 
There,  the  electrons  loose  energy  predominantly 
by  impact  ionization.  Electrons  are  able  to  fly  a 
longer  distance  than  holes  before  they  loose  energy 
and  pick  up  a  given  amount  of  energy  on  a  shorter 
trajectory.  Therefore,  the  average  electron  energy 
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FIGURE  4  The  figure  refers  to  a  20  nm  PIN  diode  with  a 
doping  level  of  2xl018cm~3  in  the  n-  and  /^-regions,  respec¬ 
tively,  and  an  applied  reverse  bias  of  5.5  V.  The  shape  of  the 
electric  field  is  shown  in  grey  to  indicate  the  width  of  the 
depletion  zone.  Its  magnitude  is  1.1  MV/cm.  (a)  Calculated 
average  energy  of  electrons  and  holes  as  a  function  of  position, 
(b)  Calculated  energy  loss  rate  of  electrons  and  holes.  Shown 
are  the  total  energy  losses  and  the  contributions  from  impact 
ionization,  (c)  Calculated  impact  ionization  generation  rate  of 
electron  hole  pairs  induced  by  electrons  and  holes,  respectively 
(full  lines).  Zener  generation  rate  of  electrons  and  holes, 
respectively  (dashed  lines),  (d)  Calculated  density  of  electrons 
and  holes. 


decreases  more  slowly  and  extends  farther  into  the 
depletion  region  on  the  n- side  than  the  hole  energy 
does  on  the  /?-side. 

The  same  effect  can  be  deduced  from  the 
spatially  resolved  energy  loss  (Fig.  4(b)).  The 
energy  loss  by  impact  ionization  is  higher  for 
electrons  than  for  holes  and  remains  large 
throughout  the  depletion  region  on  the  w-side. 
Since  electrons  gain  energy  more  rapidly  than 
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holes,  they  are  more  likely  to  impact  ionize 
repeatedly.  The  slower  holes,  on  the  other  hand, 
remain  hotter  which  leads  to  their  higher  total 
energy  loss.  Particularly  near  the  end  of  the  /7-sided 
depletion  region,  their  energy  loss  is  dominated  by 
phonon  and  plasmon  scattering.  These  results  are 
consistent  with  the  higher  integral  of  the  electron 
induced  electron-hole  pair  generation  rate  that  is 
shown  in  Figure  4(c).  This  figure  also  includes  the 
number  of  generated  electrons  and  holes  by  Zener 
tunneling.  The  latter  generation  process  gives  a 
negligible  contribution  to  the  current  within  0.2  V 
of  breakdown. 

The  higher  electron-induced  impact  ionization 
rate  generates  more  holes  near  the  i-n  junction 
than  the  opposite  process  generates  electrons  near 
the  p-i  region.  Therefore,  the  density  of  minority 
carriers  is  higher  on  the  «-side  than  on  the  /7-side 
(Fig.  4(d)).  This  effect  is  enhanced  by  the  higher 
mobility  of  electrons  that  causes  them  to  diffuse 
out  of  the  p-zone  more  rapidly.  The  asymmetry  of 
carrier  densities  within  the  high  field  region  that 
can  be  seen  in  Figure  4(d)  is  another  consequence 
of  the  higher  impact  scattering  rate  for  electrons. 
The  holes  that  are  generated  by  impact  ionization 
on  the  «-side  get  accelerated  through  the  high  field 
region  towards  the  /7-zone,  leading  to  the  excess 
density  of  holes  on  the  /7-side. 

Figure  5  compares  the  present  calculations  with 
the  measured  current-voltage  characteristics  for  a 
20  nm  and  50  nm  pin  diode,  respectively.  The 
agreement  is  seen  to  be  very  good.  In  Figure  6,  we 
predict  the  I-V  characteristics  of  30  nm  AlGaAs  p- 
i-n  structures.  Since  the  energy  gap  is  higher  than 
in  GaAs,  the  impact  ionization  rate  is  lower  which 
causes  the  breakdown  voltage  to  increase  and  the 
current  density  to  decrease  with  increasing  A1 
concentration.  The  lower  breakdown  voltage 
compared  to  the  20  nm  diode  discussed  above  is 
caused  by  the  higher  doping  level  of  the  n  and  p- 
regions. 

The  work  has  been  partially  supported  by 
SIEMENS  and  by  the  Deutsche  Forschungsge- 
meinschaft  (SFB  384). 


FIGURE  5  Calculated  (full  curves)  and  experimental  current 
densities  (stars),  in  A/cm2,  versus  applied  reverse  bias  in  V  for  a 
20  nm  and  50  nm  GaAs  p-i-n  diode.  The  doping  concentration 
in  the  n  and  /Mayers  amounts  to  2x10 18  cm”3.  The  experi¬ 
mental  data  are  from  [6]. 


Reverse  bias  [V] 

FIGURE  6  Predicted  current  densities  versus  applied  reverse 
bias  for  a  30  nm  GaAs  and  Gao.7Alo.3As  p-i-n  diode.  The 
doping  concentration  in  the  n  and  /7-layers  is  5xl018cm”3.  The 
dotted  lines  show  the  Zener  tunneling  contribution  to 
the  current  density. 

References 

[1]  Kometer,  K.,  Zandler,  G.  and  Vogl,  P.  (1992).  “Lattice- 
gas  cellular-automaton  method  for  semiclassical  transport 
in  semiconductors”,  Phys.  Rev.  B .,  46,  1382-1394. 

[2]  Zandler,  G.,  Di  Carlo,  A.,  Kometer,  K.,  Lugli,  P.,  Vogl,  P. 
and  Gornik,  E.  (1993).  “A  comparison  of  Monte  Carlo  and 
cellular  automaton  approaches  for  semiconductor  device 
simulations”,  IEEE  Electron  Dev.  Letters ,  14,  77-79. 

[3]  Rein,  A.,  Zandler,  G.,  Saraniti,  M.,  Lugli,  P.  and  Vogl,  P. 
(1994).  “Cellular  Automaton  Simulations  of  Planar 
Doped  Barrier  Field  Effect  Transistor”,  Proceedings  of 
the  3rd  International  Workshop  on  Computational  Electro¬ 
nics  (S.  M.  Goodnick,  ed.),  7-10,  Oregon  State  University 
Press,  Corvallis. 


98 


G.  ZANDLER  et  al. 


[4]  Zandler,  G.,  Saraniti,  M.,  Rein,  A.  and  Voglj  P.  (1996). 
“Cellular  Automata  for  Device  Simulation  E  Concepts 
and  Applications”,  Proceedings  of  1996  Int .  Conf  on 
Simulation  of  Semiconductor  Processes  and  Devices ,  39- 
42,  (Business  Center  for  Academic  Societies  Japan), 
Tokyo,  ISBN  0-7803-2745-4. 

[5]  Vogl,  P.,  Zandler,  G.,  Rein,  A.  and  Saraniti,  M.  (1997). 
“Cellular  Automaton  approach  for  semiconductor  trans¬ 
port”,  in  Theory  of  Transport  Properties  of  Semiconductor 
Nanostructures  (E.  Schoell,  ed.),  Chapman  and  Hall, 
London,  in  print. 

[6]  Gaul,  L.,  Huber,  S.,  Freyer,  J.  and  Claassen,  M.  (1991). 
“Determination  of  tunnel-generation  rate  from  GaAs 
PIN-structures”,  Solid  State  Electron .,  34,  723-726. 

[7]  Poebl,  M.,  Bogner,  W.  and  Gaul,  L.  (1994).  “CW  GaAs 
MIT  ATT  Source  on  Copper  Heatsink  up  to  160  GHz”, 
Electronic  Letters ,  30,  1316-1317. 

[8]  Tschernitz,  M.  and  Freyer,  J.  (1995).  “140  GHz  GaAs 
Double-Read  IMPATT  Diodes”,  Electronic  Letters ,  31, 
582-583. 

[9]  Liebig,  D.,  Lugli,  P.,  Vogl,  P.,  Claasen,  M.  and  Harth,  W. 
(1992).  “Tunneling  and  ionization  phenomena  in  GaAs 
PIN  diodes”,  Microelectronic  Engineering ,  19,  127-130. 

[10]  Canali,  C.,  Pavan,  P.,  Di  Carlo,  P.,  Lugli,  P.,  Malik,  R., 
Manfredi,  M.,  Neviani,  A.,  Vendrame,  E.,  Zanoni,  E.  and 
Zandler,  G.  (1996).  “Experimental  and  Monte  Carlo 
analysis  of  impact-ionization  in  AIGaAs/GaAs  HBT’s”, 
IEEE  Trans,  on  Elec .  Dev.,  43,  1769-1777. 

[11]  Kane,  E.  O.  (1967).  “Electron  scattering  by  pair  produc¬ 
tion  in  silicon”,  Phys.  Rev.,  159,  624-631. 

[12]  Krieger,  J.  B.  (1996).  “Theory  of  tunneling  in  semiconduc¬ 
tors  with  dgenerate  band  structure”,  Ann.  Phys.,  36,  1  -60. 

[13]  Ando,  H.  and  Kanbe,  H.  (1981).  “Ionisations  coefficient 
measurements  in  GaAs  by  using  multiplication  noise 
characteristics”.  Solid  State  Electron.,  24,  629-634. 

[14]  Bulman,  G.  E.,  Robbins,  V.  M.,  Brennan,  K.  F.,  Hess,  K. 
and  Stillman,  G.  E.  (1983).  “Experimental  determination 


of  Impact  ionization  coefficients  in  (100)  GaAs”,  IEEE 
Electron  Dev.  Lett.,  4,  181-185. 

[15]  Robbins,  V.  M.,  Smith,  S.  C.  and  Stillman,  G.  E.  (1988). 
“Impact  ionization  in  AlxGai  _x  As  for  x=0.1  -0.4”,  Appl. 
Phys.  Lett.,  52,  296-298. 

Authors’  Biographies 

Gunther  Zandler  is  research  associate  at  the  Walter 
Schottky  Institute  of  the  Technical  University  of 
Munich  and  deals  with  high  field  transport  and 
ultrafast  phenomena. 

Ralph  Oberhuber  is  doing  his  Ph.D.  thesis  at  the 
Technical  University  in  Munich  on  new  device 
concepts. 

Dennis  Liebig  is  doing  his  Ph.D.  thesis  at  the 
Technical  University  in  Hamburg-Harburg  on 
device  modeling. 

Peter  Vogl  is  professor  of  physics  at  the 
Technical  University  of  Munich.  He  is  interested 
in  the  theory  of  electronic  structure  and  carrier 
transport  in  solids. 

Marco  Saraniti  is  post-doctoral  fellow  at  ASU 
in  Tempe.  He  focuses  on  numerical  aspects  of  the 
simulation  of  electron  devices. 

Paolo  Lugli  is  professor  of  optoelectronics  at  the 
2nd  University  of  Rome  and  focuses  on  semicon¬ 
ductor  device  modeling  and  ultrafast  phenomena. 


VLSI  DESIGN 

1998,  Vol.  8,  Nos.  (1-4),  pp.  99-103 
Reprints  available  directly  from  the  publisher 
Photocopying  permitted  by  license  only 


©  1998  OP  A  (Overseas  Publishers  Association)  N.V. 

Published  by  license  under 
the  Gordon  and  Breach  Science 
Publishers  imprint. 
Printed  in  India. 


3D  Parallel  Finite  Element  Simulation  of  In-Cell 
Breakdown  in  Lateral-Channel  IGBTs 
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of  Glasgow,  Glasgow  G12  8LT,  Scotland,  UK 


In  this  paper  we  describe  the  use  of  3D  parallel  finite  element  solution  of  the  Poisson 
equation  to  calculate  the  in-cell  breakdown  voltage  in  lateral  channel  IGBTs.  The  solver 
is  based  on  topologically  rectangular  grids,  and  uses  a  domain  decomposition  approach 
to  partition  the  problem  on  an  array  of  mesh  connected  processors.  A  parallel 
BiCGSTAB  solver  has  been  developed  to  solve  the  Possion  equation.  Hole  and  electron 
ionisation  integrals  are  calculated  to  determine  the  breakdown  voltage.  The  effects  of 
varying  the  doping  concentration  in  the  n~  base  region  and  stopper  surface 
concentration  are  investigated. 

Keywords:  3D  simulation,  parallel,  finite  element,  breakdown,  IGBT 


INTRODUCTION 

Insulated  Gate  Bipolar  Transistors  (IGBTs)  [1]  are 
among  the  leading  devices  in  the  power  semicon¬ 
ductor  market,  combining  a  low  on-state  voltage 
typical  of  bipolar  devices,  with  MOS  gated  switch¬ 
ing.  Modem  IGBTs  have  a  cellular  structure  such 
as  that  shown  in  Figure  1.  The  optimisation  of  the 
cell  design  is  an  ongoing  issue.  The  current  density, 
for  example,  can  be  increased  by  reducing  the  cell- 
to-cell  separation.  Unfortunately  in  lateral  channel 
IGBTs  the  minimum  cell-to-cell  separation  is 
restricted  by  cell-to-cell  pinch-off  which  limits  the 
current  density.  Although  vertical-channel  non 
pinch-off  devices  have  been  demonstrated  [2]  the 


lateral  channel  IGBTs  with  stopper  implantation 
are  still  the  preferred  technological  choice.  The 
stopper,  however,  should  not  compromise  the 
overall  device  breakdown.  The  calculation  of  the 
in-cell  breakdown  in  the  stopper  design  process 
requires  an  accurate  3D  solution  of  the  Poisson 
equation  [3]  with  fine  resolution  around  the 
metallurgical  p-n  junction. 

In  this  paper  we  describe  the  calculation  of  the 
in-cell  IGBT  breakdown  using  a  parallel  finite 
element  3D  solution  of  the  Poisson  equation  on  an 
array  of  processors.  The  3D  discretisation  is  based 
on  a  topologically  rectangular  finite  element  (FE) 
grid  which  facilitates  the  partitioning  of  the 
domain  over  arrays  of  processors.  Results  for  the 
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FIGURE  1  Schematic  view  of  the  top  of  an  octagonal  cell 
IGBT. 


dependence  of  the  breakdown  voltage  on  the 
device  geometry  and  the  doping  profile  are 
presented. 


PARALLEL  IMPLEMENTATION 

The  Poisson  solver  used  to  study  the  in-cell  IGBT 
breakdown  is  part  of  a  general  purpose  device 
simulator  in  development  [4].  This  simulator  is 
designed  for  parallel  MIMD  architectures  based 
on  a  2D  array  of  mesh  connected  processors.  The 
current  version  runs  on  an  8  node  PowerPC 
Parsytec  X-plorer  under  Parix.  This  provides  the 
necessary  computing  power  for  computationally 
intensive  3D  simulation. 

The  parallel  implementation  is  based  on  the 
spatial  domain  decomposition  approach,  whereby 
the  solution  domain  is  partitioned  across  the  2D 
array  of  processors.  Each  processor  stores  infor¬ 
mation  and  updates  the  solution  only  for  the  nodes 
which  belong  to  its  subdomain.  The  partitioning  is 
greatly  simplified  by  the  use  of  a  topologically 
rectangular  grid  which  is  described  later.  Both  a 
parallel  4-colour  block  Newton  SOR  solver,  and  a 
parallel  BiCGSTAB  solver  have  been  developed 


and  used  for  the  solving  the  Poisson  equation. 
While  the  SOR  solver  is  efficient  and  works  well  at 
low  applied  voltage,  convergence  problems  at  the 
high  voltages  require  for  breakdown  calculations 
were  encountered.  The  BiCGSTAB  solver  in 
combination  with  Newton  iterations,  and  appro¬ 
priate  damping,  works  well  in  the  high  voltage 
range. 

Due  to  the  large  problem  size,  there  is  not 
sufficient  memory  to  run  the  program  on  an 
individual  processor  of  the  Parsytec  system  and 
to  have  a  proper  speed-up  assesment.  However  the 
speed-up  obtained  by  running  on  the  8x90  MHz 
PPC601  Parsytec  system,  compared  to  a  single 
170  MHz  UltraSparc  Workstation  is  approxi¬ 
mately  2.25. 


GRID  GENERATION 

In  order  to  obtain  the  required  accuracy  in  the 
solution  of  the  Poisson  equation  it  is  essential  to 
have  a  finite  element  grid  which  conforms  to  the 
shape  of  the  metallurgical  p-n  junctions  and  to  the 
cell  topology.  The  use  of  unstructured  finite  element 
grids  can  present  serious  problems  when  the  parallel 
implementation  is  based  on  domain  decomposition, 
and  can  lead  to  very  low  efficiency  of  the  parallel 
code.  To  avoid  these  difficulties  we  have  developed 
a  topologically  rectangular  FE  grid  [5]. 

The  structured  topologically  rectangular  grid 
retains  the  number  of  grid  nodes  in  any  particular 
index  direction.  This  makes  the  partitioning  of  the 
grid  over  an  array  of  mesh  connected  processors  a 
straight  forward  procedure.  The  partitioning  is 
done  by  dividing  the  number  of  grid  nodes  in  each 
index  direction  by  the  number  of  processors 
available  in  the  processor  array  in  the  same 
direction  [6].  For  such  partitioning  only  nearest- 
neighbour  communications  are  required  in  the 
design  of  iterative  solvers,  because  adjacent  nodes 
in  adjacent  partition  subdomains  appear  on 
neighbouring  processors. 

We  have  implemented  two  different  approaches 
for  the  generation  of  topologically  rectangular 
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grids  in  the  discretisation  of  the  IGBT  cell  (Fig.  2). 
The  most  straightforward  method  is  to  deform  a 
Cartesian  grid  to  fit  key  contours  in  the  device 
structure,  such  as  the  p-n  junctions.  This  method 
was  used  in  the  generation  of  the  grid  in  the 
vertical  walls  of  the  IGBT  cell.  The  deformation 
algorithm  has  been  designed  to  be  as  general  as 
possible  to  allow  the  use  of  arbitrarily  shaped 
defining  contours  described  in  a  parameterised 
form.  This  allows  the  generation  of  complex  grids. 
A  different  method  was  used  in  the  design  of  the 
grid  for  the  top  plane  of  the  IGBT.  This  method 
again  uses  key  contours  such  as  the  emitter  contact 
edge  and  surface  p-n  junctions.  These  contours  are 
used  to  generated  additional  guiding  contours,  the 
spacing  of  which  controls  the  grid  density.  The 
correct  number  of  nodes  are  placed  on  each  gui¬ 
ding  contour  to  satisfy  the  criteria  for  a  topologi¬ 
cally  rectangular  grid.  This  method  is  more 
flexible,  but  has  the  restriction  that  the  grid  must 
have  equal  numbers  of  nodes  in  each  direction. 

The  full  3D  grid  is  generated  by  extending  the 
grid  for  the  top  surface  vertically,  and  then 


FIGURE  2  Topologically  rectangular  3D  grid  showing  top 
and  sides  of  the  3D  grids. 


deforming  this  3D  grid  by  the  deformations 
calculated  for  the  side  walls  of  the  cell.  Figure  3 
shows  the  full  3D  grid  for  the  region  down  to  the 
p-n~  junction,  clearly  illustrating  the  internal 
curvature  of  the  grid,  and  the  shape  of  the  p-n 
junction  itself. 


BREAKDOWN  CALCULATION 
AND  RESULTS 

The  calculation  of  breakdown  voltage  is  based  on 
the  evaluation  of  the  ionisation  integrals  for  holes 
and  electrons.  From  the  solution  of  the  Poisson 
equation  the  electric  field  in  each  element  is 
calculated,  and  the  point  of  maximum  electric 
field  strength  is  determined.  The  integration  path 
for  the  ionisation  integrals  passes  through  this 
point  and  is  traced  along  the  electric  field  vector  in 
both  directions  until  either  a  boundary  is  encoun¬ 
tered,  or  the  electric  field  strength  becomes 
negligible.  The  ionisation  integrals  along  this  path 
are  then  calculated.  The  hole  ionisation  integral 
was  found  to  be  more  sensitive  to  changes  in 
applied  voltage  and  was  used  in  the  procedure  for 
tracing  the  breakdown  voltage.  However  we  check 
that  both  hole  and  electron  integrals  become  unity 
at  the  breakdown  voltage. 

The  point  of  highest  electric  field  occurs  near  the 
corner  of  the  p-n~  junction  which  corresponds  to 


FIGURE  3  The  grid  at  the  metallurgical  p-n  junction 
showing  junction  curvature. 
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the  corner  of  the  octagonal  emitter  contact.  This  is 
the  region  where  the  junction  exhibits  the  highest 
curvature.  The  effect  of  the  doping  in  the  yT  base 
region  on  the  breakdown  voltage  is  shown  in 
Figure  4.  The  in-cell  breakdown  is  compared  to 
the  breakdown  voltage  for  a  one-sided  planar 
junction.  As  expected,  the  increasing  doping 
concentration  of  the  base  region  reduces  the 
breakdown  voltage  significantly.  Although  the 
smaller  inter-cell  distance  and  the  gate  potential 
significantly  reduce  the  curvature  of  the  depletion 
layer  edge,  and  the  strength  of  the  fringing  field, 
the  IGBT  cell  shows  a  noticeable  reduction  in 
breakdown  voltage  compared  to  the  planar  junc¬ 
tion. 

The  introduction  of  a  surface  stopper  implanta¬ 
tion  reduces  the  channel  length,  prevents  the 
pinch-off  of  the  vertical  JFET,  and  lowers  the 
on-state  voltage  drop.  The  stopper  concentration, 
however,  affects  the  in-cell  breakdown.  Figure  5 
shows  the  effect  that  increasing  the  surface  con¬ 
centration  of  the  implantation  has  on  the  break¬ 
down  voltage  of  a  typical  IGBT  rated  at  600  V. 
For  a  surface  concentration  above  1016cm“3  the 
breakdown  voltage  falls  sharply  and  the  stopper 
concentration  limits  the  blocking  capabilities  of 
the  device. 


FIGURE  4  Breakdown  voltage  vs  n  base  region  doping 
concentration. 


Stopper  surface  concentration  (cm-3) 


FIGURE  5  Breakdown  voltage  vs  stopper  implant  surface 
concentration. 


CONCLUSIONS 

In  this  paper  we  have  described  3D  parallel  finite 
element  simulation  of  in-cell  breakdown  voltages 
in  cellular  IGBTs.  The  simulator  is  based  on  a 
topologically  rectangular  grid  which  reflects  the 
geometry  of  the  device,  and  simplifies  the  parti¬ 
tioning  of  the  solution  domain  in  the  domain 
decomposition  based  parallel  implementation.  The 
results  of  the  breakdown  calculations  illustrate  the 
importance  of  realistic  3D  simulation  in  the  design 
of  cellular  IGBTs. 
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Multidimensional  Schrodinger-Poisson  Equations 

A.  TRELLAKIS1 * 3  *,  A.  T.  GALICKa  ,  A.  PACELLIb  and  U.  RAVAIOLI3 


a  Beckman  Institute,  University  of  Illinois  at  Urbana-Champaign,  Urbana,  IL  61801; 
b  Dipartimento  di  Elettronica  e  Informazione  and  CEQSE-CNR,  Politecnico  di  Milano, 
Piazza  Leonardo  da  Vinci  32,  20133  Milano,  Italy 


We  present  a  fast  and  robust  iterative  method  for  obtaining  self-consistent  solutions  to  the 
coupled  system  of  Schrodinger’s  and  Poisson’s  equations  in  quantum  structures.  A  simple 
expression  describing  the  dependence  of  the  quantum  electron  density  on  the  electrostatic 
potential  is  used  to  implement  a  predictor  -  corrector  type  iteration  scheme  for  the  solution 
of  the  coupled  system  of  differential  equations.  This  approach  simplifies  the  software 
implementation  of  the  nonlinear  problem,  and  provides  excellent  convergence  speed  and 
stability.  We  demonstrate  the  algorithm  by  presenting  an  example  for  the  calculation  of  the 
two-dimensional  bound  electron  states  within  the  cross-section  of  a  GaAs-AlGaAs  based 
quantum  wire.  For  this  example,  six  times  fewer  iterations  are  needed  when  our  predictor  - 
corrector  approach  is  applied,  compared  to  a  corresponding  underrelaxation  algorithm. 


Keywords :  Semiconductor,  simulation,  quantum  wire,  iteration,  predictor -corrector,  Schrodin¬ 
ger-Poisson 


1  INTRODUCTION 

As  electronic  device  dimensions  approach  nano¬ 
meter  scale,  quantum  effects  are  expected  to 

dominate  their  electronic  properties.  Consequently 
there  is  considerable  interest  in  the  efficient 
numerical  simulation  of  such  structures,  not  only 
for  exploring  novel  device  architectures  but  also 
for  maintaining  reliability  for  reduced  present-day 
devices.  In  this  paper  we  present  a  fast  and  robust 
iterative  method  for  solving  the  two-dimensional 


Schrodinger-Poisson  equations  and  obtaining  the 
electron  states  in  the  cross-section  of  a  quantum 
wire,  and  we  compare  the  efficiency  of  our  method 
with  the  standard  underrelaxation  algorithm. 

The  physical  model  used  to  describe  the 
quantum  wire  consists,  in  the  effective  mass 
approximation,  of  Schrodinger’s  equation 


4-  [Vh  —  e<j)  +  Vxc(n)  —  Ei\il)i  =  0 


(i) 
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coupled  with  a  nonlinear  Poisson  equation 

V(eV0)  =  -e{-n+p  +  N£-  NX),  (2) 

where  the  unknowns  are  the  wavefunctions  0/ 
belonging  to  energy  levels  Eh  and  the  electrostatic 
potential  0.  Here,  m*  is  the  electron  effective  mass, 
e  the  dielectric  constant,  n  and  p  are  the  electron 
and  hole  concentrations,  N £  and  N ^  the  ionized 
donor  and  acceptor  concentrations,  Vh  the  hetero¬ 
junction  step  potential,  and  finally  Vxc  the 
exchange  correlation  potential  in  the  local  density 
approximation. 

The  quantum  electron  density  n  is  obtained 
from  the  eigenpairs  (Eh  0/)  of  Schrodinger’s 
equation  as 

«  =  5>,|W  I2,  (3) 

where 


denotes  the  occupancy  of  the  /- th  eigenstate,  and  T 
is  the  Temperature,  kB  Boltzmann’s  constant,  EF 
the  Fermi  level,  and  JL\/2  the  complete  Fermi- 
Dirac  integral  of  order  -1/2. 

We  avoid  the  expense  of  solving  Schrodinger’s 
equation  in  a  very  large  grid  by  using  the  quantum 
formulation  for  n  (3)  only  within  the  region  of  the 
quantum  wire  itself,  while  in  the  rest  of  the 
simulation  domain,  where  the  electron  density  is 
small,  we  replace  it  by  its  classical  expression.  The 
hole  concentration  p  is  very  small  throughout  the 
quantum  wire,  but  needs  to  be  included  into 
Poisson’s  equation  to  ensure  a  correct  description 
of  the  band  bending  in  the  substrate  [1]. 


2  SOLUTION  BY  UNDERRELAXATION 

This  system  of  coupled  nonlinear  differential 
equations  is  usually  solved  by  an  iteration  between 
Poisson’s  and  Schrodinger’s  equation  [2,  3].  Since 
a  plain  iteration  by  itself  does  not  converge,  it  is 


necessary  to  underrelax  in  the  electron  density  n 
using  an  adaptively  determined  relaxation  para¬ 
meter  Jh\  This  underrelaxation  approach  can  be 
briefly  outlined  as  follows: 

1 .  Solve  nonlinear  Poisson  equation  using  the  old 

electron  density  to  obtain  electrostatic 

potential  <jp 

2.  Solve  Schrodinger’s  equation  using  and 
kxc(«(A:_1)  to  obtain  a  new  set  of  eigenpairs 
(Ef\ # ). 

3.  Calculate  an  intermediate  electron  density  n^. 

4.  Choose  an  appropriate  relaxation  parameter 

to  obtain  new  density  Pk\  and  repeat  outer 
iteration  until  n  becomes  stationary. 

The  weakness  of  this  method  is  the  inherent 
instability  of  the  outer  iteration  which  is  controlled 
by  the  underrelaxation  procedure  only.  The 
relaxation  parameter  is  not  known  in  advance 
and  needs  to  be  dynamically  readjusted  during  the 
course  of  the  iteration.  If  Pk)  is  too  large,  the  total 
quantized  charge  Jndx  oscillates  from  one  itera¬ 
tion  step  to  the  other  without  reaching  conver¬ 
gence,  while  if  (J k)  is  too  small,  too  many  iteration 
steps  are  necessary  to  achieve  convergence. 


3  SOLUTION  BY  A  PREDICTOR - 
CORRECTOR  TYPE  APPROACH 

To  address  this  problem  we  have  to  modify  the 
underrelaxation  algorithm  in  a  way  that  partially 
decouples  both  partial  differential  equations  and 
damps  the  oscillations  in  the  total  electric  charge. 
We  can  achieve  this  goal  by  incorporating  a 
modified  expression  for  the  quantum  electron 
density  «(0)  into  Poisson’s  equation,  which 
approximates  the  implicit  dependency  of  the 
quantum  electron  density  n  on  the  electrostatic 
potential  0  due  to  Schrodinger’s  equation. 

A  suitable  expression  of  this  type  is  provided  by 
quantum  mechanical  perturbation  theory.  Using 
its  formalism  one  can  show  [4,  5],  that  for  a  small 
perturbation  of  the  electrostatic  potential  6<j>  the 
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occupancies  Nt  within  the  electron  density  (3) 
become  ^-dependent  as 

N,m  «  N,(6<t>)  (5) 

=  (O 

In  our  predictor— corrector  approach  we  utilize 
this  result  by  applying  it  to  arbitrary  large  changes 
in  </>: 

We  start  the  £>th  iteration  step  by  solving  a 
modified  Poisson  equation 

V(eV#>)  -  e[h{^) -P-N++NI),  (7) 

where  the  potential-independent  quantum  electron 
density  n  is  replaced  by  the  potential  dependent 
predictor  h, 

h{4k))  =  -  ^_1)) (8) 

with  occupancies  Nf  ^  defined  as  in  (6)  and  the 
superscript  k—  1  indicating  the  use  of  Ef  The 
electrostatic  potential  <j>(k)  obtained  from  solving 
Poisson’s  equation  (7)  together  with  the  predicted 
value  for  the  electron  density  n(<pW)  is  then  used 
within  Schrodinger’s  equation  (corrector) 

-%v{\  V^(t)N)  +{Vh-  e<t>W 

1  \m  /  (9) 

+Vxc(n^)-E^f)=0 

to  calculate  a  corrected  update  n(k)  to  the  exact 
electron  density 

=  Y'N(ik)\ipf)\2.  (io) 

l 

It  is  essential  to  insert  the  latest  predictor  value  n{k) 
into  the  exchange  correlation  in  Schrodinger’s 
equation  (9),  since  the  numerical  experiment  shows 
that  the  use  of  n(fc-1)  within  Vxc  is  detrimental  to 


convergence.  The  use  of  n  k  also  follows  the  rule 
that  the  latest  predictor  value  should  enter  the 
corrector. 

The  numerical  experiment  also  shows,  that, 
unlike  in  the  standard  algorithm,  convergence  is 
achieved  without  underrelaxation,  so  the  outer 
iteration  can  be  reduced  to  a  simple  alternation 
between  Poisson’s  and  Schrodinger’s  equation 
until  the  quantum  electron  density  n  becomes 
stationary: 

Residual(n)  =  || —  n^_^||2  <  £•  (11) 


4  NUMERICAL  RESULTS 

We  compared  the  efficiency  of  both  iteration 
schemes  by  computing  the  bound  states  for  the 
model  device  shown  in  Figure  1  [2].  The  quantum 
wire  is  based  on  a  GaAs-AlGaAs  modulation 
doped  structure,  and  delimited  by  two  metal  gates 
with  a  distance  of  400  nm.  The  gate  voltages  were 
chosen  as  VG=  1.3  V  with  respect  to  the  substrate 
for  both  contacts;  additionally  we  assumed  a 
Schottky  barrier  of  approximately  1  eV  between 
metal  and  semiconductor  material.  The  A1  con¬ 
centration  was  set  equal  to  26%  throughout  the 
AlGaAs  layer,  and  we  also  included  a  fixed  surface 
charge  density  of  1.6xl012  cm-2  at  the  GaAs-air 
interface.  All  calculations  were  done  at  a  tempera¬ 
ture  of  T=  4.2  K. 

Both  Schrodinger’s  equation  as  well  as  Poisson’s 
equation  are  discretized  by  a  box  integration  finite 
difference  method  to  take  material  discontinuities 
into  account.  Since  the  quantum  wire  covers  only  a 
small  part  of  the  entire  simulation  domain,  we 
employ  a  non-uniform  rectangular  mesh  concen¬ 
trated  around  the  wire  region  to  minimize 
computational  cost  while  retaining  high  accuracy 
within  the  region  of  interest.  For  the  model  device 
discussed  above  a  99  x  88  grid  was  found  to  yield 
satisfactory  spatial  resolution  and  short  program 
run-times  on  HP100  workstations. 

After  discretization,  Schrodinger’s  equation 
becomes  a  sparse  eigenvalue  problem,  which  is 
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FIGURE  1  The  model  device  structure  used  in  the  calculations. 


solved  by  Chebyshev-Arnoldi  iteration  [6].  This 
method  allows  us  to  compute  only  the  physically 
relevant  lowest-energy  states.  The  nonlinear  Pois¬ 
son  equation  is  solved  by  Newton-Raphson 
iteration  with  inexact  line  search.  The  necessary 
solution  of  a  sparse  linear  system  at  each  iteration 
step  is  accomplished  by  a  version  of  the  precondi¬ 
tioned  conjugate  gradient  method  involving  red- 
black  reordering.  The  Fermi-Dirac  integrals, 
which  need  to  be  evaluated  repeatedly  at  each 
grid  point,  are  computed  using  rational  function 
approximations,  which  combine  speed  of  execu¬ 
tion  with  high  accuracy  [4,  7]. 

We  compare  the  convergence  speed  of  the 
predictor-corrector  method  with  a  fast  adaptive 
underrelaxation  scheme  (an  adaptive  nonlinear 
version  of  the  standard  Gauss-Seidel  algorithm) 
developed  by  one  of  the  authors  [6],  which  uses  a 
heuristic  method  to  adjust  the  relaxation  para¬ 
meter  oj(k\  Using  the  residual  in  the  quantum 
electron  density  (11)  as  measure  to  quantify 
convergence  we  find  that  our  predictor  -  corrector 
approach  makes  the  outer  iteration  stable.  In  fact, 
the  residual  decreases  by  close  to  one  order  of 
magnitude  each  step  (Fig.  2),  which  is  a  6-fold 
increase  in  convergence  speed  compared  to  under¬ 
relaxation.  Additionally  we  find  that  the  residual 
decreases  uniformly  from  one  step  to  the  next, 


FIGURE  2  Residual  of  the  quantum  electron  density  using  a 
99x88  grid. 


while  for  the  heuristic  underrelaxation  algorithm 
occasional  increases  are  possible. 

The  inclusion  of  exchange  correlation  Vxc  results 
in  an  additional  slowdown  of  convergence  for  the 
underrelaxation  scheme  and  also  decreases  its 
stability.  However,  our  predictor -corrector  meth¬ 
od  remains  stable  and  efficient,  if  the  last  available 
predictor  value  is  inserted  into  Vxc  as 

outlined  above. 

We  verified  the  rapid  uniform  convergence  of 
our  predictor -corrector  method  for  a  wide  range 
of  temperatures,  gate  voltages  and  geometries,  and 
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found  it  in  all  cases  clearly  superior  to  adaptive 
underrelaxation.  This  result  indicates  a  fairly  wide 
range  of  applicability,  and  considering  the  ease  of 
implementation  of  our  method  we  strongly  re¬ 
commend  its  use  even  for  existing  codes. 
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We  present  systematic  theoretical  Cellular  Automata  (CA)  studies  of  a  novel  nanometer 
scale  Si  device,  namely  vertically  grown  Metal  Oxide  Field  Effect  Transistors 
(MOSFET)  with  channel  lengths  between  65  and  120  nm.  The  CA  simulations  predict 
drain  characteristics  and  output  conductance  as  a  function  of  gate  length.  The  excellent 
agreement  with  available  experimental  data  indicates  a  high  quality  oxide/semiconduc- 
tor  interface.  Impact  ionization  is  shown  to  be  of  minor  importance.  For 
inhomogeneous  /^-doping  profiles  along  the  channel,  significantly  improved  drain 
current  saturation  is  predicted. 


Keywords:  Cellular  automaton,  submicron  MOSFET,  Silicon,  simulation,  semiconductor  device 


1  INTRODUCTION 

As  a  numerically  very  efficient  discrete  variant  of 
the  Monte  Carlo  (MC)  technique  [1],  the  CA  [2] 
method  significantly  reduces  the  gap  in  computa¬ 
tional  speed  between  simulation  techniques  based 
on  solving  the  full  Boltzmann  equation,  and 
moment-based  approaches  such  as  the  hydrody¬ 
namic  method.  The  CA  method  is  therefore 
particularly  suited  to  simulate  highly  nonlinear 
charge  transport,  typical  for  the  ultra-short  devices 
described  in  this  paper. 

2  VERTICAL  MOSFET 

Molecular  Beam  Epitaxy  (MBE)  and  Chemical 
Vapor  Deposition  (CVD)  allow  the  realization  of 


vertical  devices  with  characteristic  channel  lengths 
well  below  0.1pm.  Recently,  CVD  epitaxy  was 
utilized  to  grow  vertical  MOSFET’s  with  ultra 
short  channels  (see  [3,4]).  The  epitaxially  grown 
layer  structures  are  selectively  etched  to  a  depth  of 
0.8  pm  and  thermally  oxidized  to  grow  the  Si02 
gate  dielectric  with  a  thickness  of  5  nm.  The 
quality  of  the  oxide  is  comparable  to  planar 
structures.  Figure  1  shows  the  cross  section  of 
such  a  vertical  MOS  [4].  The  simulated  geometry  is 
shown  on  the  left  side  of  the  figure,  and  corres¬ 
ponds  to  one  half  of  the  real  device. 

Devices  with  different  doping  levels  and  channel 
length  were  fabricated.  The  doping  concentra¬ 
tion  in  the  source  and  drain  n  +  regions  are  in  the 
range  1018cm-3~2  x  1019cm“3,  while  the  ^-buffer 
layer  contains  an  acceptor  concentration  of 
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2x  1018cm-3.  These  devices  were  grown  with  an 
effective  channel  length  varying  from  65  to  170  nm. 

To  investigate  the  influence  of  nonlinear  trans¬ 
port  on  the  electrical  characteristics  of  such  ultra 
short  devices,  we  performed  systematic  calcula¬ 
tions  for  all  the  realized  structures  and  compare 
our  results  to  experimental  data.  We  further 
explore  the  influence  of  inhomogeneous  ^-doping 
along  the  channel  in  improving  the  drain  current 
saturation  behavior. 


2.1  Current- Voltage  Curves 

As  a  critical  test  for  the  nominal  geometry  settings 
like  oxide  thickness  and  /^-doping  level,  we  have 
compared  the  experimental  and  calculated  sub¬ 
threshold  behavior  of  the  devices  described  above. 
Excellent  agreement  was  obtained  for  several 
devices  with  different  geometries  and  doping 
concentrations,  which  indicates  that  the  nominal 
and  technologically  realized  values  of  the  geometry 
agree  well  with  each  other.  This  is  exemplified  in 
Figure  2  for  the  65  nm  CVD  vertical  MOSFET 
structure.  There,  the  applied  drain  voltage  is  0.5  V. 

Based  on  these  geometries,  we  investigated  the 
drain  current  saturation  behavior  of  the  vertical 
MOSFET  for  different  channel  lengths.  Figure  3 
shows  the  output  current  versus  drain  voltage  for 
three  different  channel  lengths  of  170  nm,  120  nm 
and  65  nm,  respectively.  The  bias  of  the  gate  was 
fixed  to  the  value  of  2  V.  Clear  drain  current 
saturation  behavior  is  found  even  for  the  ultra- 
short,  65  nm  device.  The  increasing  output  current 


FIGURE  1  Layout  of  the  vertical  CVD-EPI  MOS.  The 
growth  direction  is  shown,  while  the  equivalent  simulated 
region  is  on  the  right  side. 


Vgate(V) 

FIGURE  2  Sub-threshold  output  current  of  the  simulated 
device  versus  gate  voltage  when  a  low  (FD  =  0.5  V)  drain  bias  is 
applied. 


FIGURE  3  Computed  (points)  and  experimental  (full  lines) 
current-voltage  characteristics  with  decreasing  channel  length 
at  VG  -  2  V  gate  bias. 


and  output  conductance  with  decreasing  channel 
length  reflect  a  strong  barrier  reduction  at  the 
source  junction.  Nevertheless,  even  for  high  bias 
condition,  no  parasitic  charge  transport  occurs  in 
the  bulk  parallel  to  the  conduction  channel.  The 
increase  of  drain  current  and  output  conductance 
due  to  the  electrostatic  effects  mentioned  above  are 
further  enhanced  by  velocity  overshoot  in  the 
shorter  channels.  This  velocity  overshoot  is  driven 
by  a  high,  inhomogeneous  electric  field  along  the 
channel.  For  the  65  nm  n-p-n  +  structure  biased  at 
VD  -  VG  =  2  V,  we  find  a  peak  value  of  800  kV/cm 
at  the  n  +p  drain  junction. 
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For  this  bias  condition,  the  average  electron 
velocity  in  the  conduction  channel  of  the  65  nm 
device  is  shown  in  Figure  4.  Pronounced  velocity 
overshoot  is  evident  all  along  the  conduction 
channel,  the  maximum  velocity  being  reached  at 
the  drain  end  of  the  /7-buffer,  where  it  peaks  to  a 
value  of  three  times  the  saturation  velocity  in 
homogeneous  Si. 

The  overall  agreement  for  the  complete  current - 
voltage  characteristics  between  the  CA  results  and 
experimental  data  is  exemplified  for  the  70  nm 
device  in  Figure  5.  For  this  device,  simulations  were 
performed  up  to  a  bias  value  of  FD  =  2.5  V,  to 
investigate  the  occurrence  of  impact  ionization  at 
higher  bias.  Up  to  the  highest  bias  point  shown, 
negligible,  contributions  due  to  impact  ionization 
were  found.  However,  this  process  becomes  im¬ 
portant  at  higher  drain  bias.  Analogous  results 
were  found  for  the  65  nm  device. 


2.2  Bulk  Effects 

The  potential  barriers  due  to  the  np  (source)  and 
pn  (drain)  junctions  prevent  a  parasitic  bulk 
current  from  flowing  parallel  to  the  channel 
inversion  layer.  At  the  same  time,  the  electric  field 
due  to  these  junctions  can  reach  high  values,  giving 
rise  to  impact  ionization  or  even  field  breakdown. 
Optimization  of  such  devices  must  account  for  the 
electrostatic  behavior  of  the  bulk  region  in  order 
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FIGURE  5  Computed  (points)  and  experimental  (full  lines) 
current -voltage  characteristics  of  the  70  nm  vertical  MOS- 
FET. 


to  improve  saturation  and  avoid  impact  ionization 
at  a  given  operating  bias.  Figure  6  (left)  shows  the 
concentration  of  carriers  in  the  bulk  65  nm  n+pn 
system  when  a  bias  of  2  V  is  applied  between  the  n 
and  n +  regions  corresponding  to  the  source  and 
drain  of  the  device  shown  in  Figure  1.  The  fact 
that  the  potential  barrier  due  to  the  npn  junction 
(Fig.  6)  is  still  intact  at  this  drain  bias  ensures  that 
no  parasitic  bulk  current  is  flowing. 


2.3  Design  Optimization 

In  order  to  improve  the  saturation  behavior  of  the 
vertical  MOSFET,  an  additional  20  nm  p  +  -buffer 
was  introduced  in  the  central  /7-region.  The  chosen 
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FIGURE  4  Average  drift-velocity  of  electrons  in  the  C-VD- 
EPI  MOSFET  of  Figure  1 ;  a  pronounced  velocity  overshoot  is 
evident  in  the  p-buffer. 


FIGURE  6  Carrier  concentration  (left)  and  potential  profile 
(right)  in  the  bulk  n-p-n  structure  present  in  the  vertical  65  nm 
MOSFET. 
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doping  concentration  in  this  p  +  -buffer  is 
3xl018cm-3  .  The  buffer  was  placed  lOnm  from 
the  n-p  source  junction.  The  buffer  extension  and 
position  were  optimized  to  reduce  the  influence  of 
the  drain  potential  on  the  output  current,  thus 
improving  the  saturation  behavior.  Its  separation 
from  the  n-p  source  junction  keeps  the  value  of  the 
junction  field  low  enough  to  avoid  impact  ioniza¬ 
tion.  Additionally,  punch-through  action  is  reduced 
due  to  the  higher  doping,  which  is  crucial  for  devices 
with  shorter  channels.  The  drain  current  reduction 
due  to  the  presence  of  the  buffer  was  compensated 
by  reducing  the  oxide  thickness  to  4  nm. 

Simulation  results  for  the  device  with  the  p 4'- 
buffer  are  compared  in  Figure  8  with  measure¬ 
ments  made  on  a  transistor  with  a  homogeneously 
doped  channel.  The  saturation  behavior  is  re¬ 
markably  improved  compared  to  the  character¬ 
istics  shwon  in  Figure  5,  and  the  reduced  oxide 
thickness  completely  compensates  the  current 
reduction  due  to  the  p  +  -buffer. 

3  CONCLUSIONS 


Vdrain  (V) 

FIGURE  8  Improved  saturation  behavior  of  the  simulated 
MOSFET  after  adding  an  additional  20  nm  p+  buffer.  The 
simulation  results  (full  lines)  are  compared  with  experimental 
data  of  Figure  5. 

excellent  agreement  with  the  experiments.  The 
crucial  role  of  the  bulk  parasitic  conduction  was 
stressed,  and  an  optimized  device  design  was 
proposed  which  improves  the  overall  performance 
of  the  device. 


The  capability  of  the  CA  approach  to  accurately 
predict  highly  nonlinear  transport  behavior  and 
the  resulting  electrical  characteristics  in  real 
nanostructured  semiconductor  devices  was  de¬ 
monstrated.  A  novel  family  of  vertical  MOSFET’s 
was  investigated,  giving  results  which  are  in 


m* — H*“ 

10  20  40 

FIGURE  7  Cross-section  of  the  saturation  optimizing  geo¬ 
metry.  A  3  xl018cm”3  /?+-buffer  is  included  within  the  central 
p  region. 
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X3D  Moving  Grid  Methods  for  Semiconductor 

Applications* 
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The  Los  Alamos  3D  grid  toolbox  handles  grid  maintenance  chores  and  provides  access 
to  a  sophisticated  set  of  optimization  algorithms  for  unstructured  grids.  The  application 
of  these  tools  to  semiconductor  problems  is  illustrated  in  three  examples:  grain  growth, 
topographic  deposition  and  electrostatics.  These  examples  demonstrate  adaptive 
smoothing,  front  tracking,  and  automatic,  adaptive  refinement/derefinement. 


Keywords:  Adaptive  mesh  smoothing,  multimaterial  grids,  moving  grids,  moving  surfaces, 
unstructured  grids,  moving  finite  elements 


INTRODUCTION 

3D  grain  growth  modeling  and  3D  topographic 
simulation  have  in  common  the  requirement  of 
accurately  representing  time  dependent  surface 
motion  in  a  3D  volume.  Problems  involving  fixed 
surfaces  such  as  parasitic  parameter  extraction  for 
interconnect  modeling  can  benefit  from  adaptive 
grid  methods  because  they  substantially  reduce 
solution  error  from  iteration  to  iteration.  The  Los 
Alamos  X3D  grid  toolbox  provides  a  set  of 
capabilities  including  initial  grid  generation  of 
complex  multimaterial  geometries  and  grid  opti¬ 
mization  that  preserves  material  interfaces.  X3D 


data  structures  and  toolbox  methods  are  designed 
as  objects  in  order  to  be  user  accessible  and 
extensible.  X3D  commands  can  be  issued  from 
within  an  application  driver  program,  and  the 
example  applications  use  this  feature  to  perform, 
as  needed,  grid  reconnection,  node  merging  and 
smoothing.  Additionally,  all  X3D  data  structures 
are  available  to  the  application  driver  via  calls  to 
utility  routines.  Thus  when  an  application  detects 
a  non-routine  event  such  as  a  topological  change 
in  a  material  region  under  deformation,  a  special 
purpose  user  routine  can  easily  be  incorporated 
into  the  system.  These  design  features  promote  the 
separation  of  the  physical  based  simulation  from 
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the  grid  maintenance  chores,  but  allow  full  access 
to  the  grid  data  structures  when  required. 

Gradient  Weighted  Moving  Finite  Elements 
Applied  to  Metallic  Grain  Growth 

In  our  first  application  involving  the  X3D  toolbox 
[1],  we  use  Gradient  Weighted  Moving  Finite 
Elements  [2]  (GWMFE)  to  move  a  multiply- 
connected  network  of  triangles  to  model  the 
annealing  of  3-D  metallic  grains.  We  assume 
evolution  of  grain  interfaces  obeys  the  simple 
equation 

Vn  =  t^K, 

where  vn  is  the  normal  velocity  of  the  interface,  \x  is 
the  mobility,  and  K  is  the  local  mean  curvature  [3]. 
Gradient  Weighted  Moving  Finite  Elements  mini¬ 
mizes 

J(vn-/2K)2dS 

over  all  possible  velocities  of  the  interface  vertices. 
(The  integral  is  over  the  surface  area  of  the 
interfaces.)  This  leads  to  system  of  3 N  ODE’s: 

C(y)y  =  g(y), 

where  y  is  the  3V-vector  containing  the  x,  y ,  and  z 
coordinates  of  all  N  interface  vertices,  C(y)  is  the 
matrix  of  inner  products  of  finite  element  basis 
functions,  and  g(y)  is  the  right-hand  side  of  inner 
products  involving  surface  curvature.  The  ODE’s 
are  integrated  using  an  implicit  variable  time  step 
integrator. 

As  an  example,  we  evolve  a  5  grain  micro¬ 
structure  in  the  confined  geometry  of  an  aluminum 
interconnect  on  a  semiconductor  chip,  and  in 
Figure  1  we  show  the  smooth  surfaces  of  the  grains 
at  an  intermediate  time  in  their  evolution  under 
mean  curvature.  (The  initial  state  for  the  time 
evolution  was  obtained  using  Monte  Carlo  anneal¬ 
ing  of  a  discrete  effective  model  on  a  fixed  lattice 
[4]  and  is  not  shown.)  Visible  in  this  exploded  view 


FIGURE  1  5  grain  microstructure  evolved  under  mean 

curvature  motion  using  GWMFE. 


FIGURE  2  Evolved  5  grain  microstructure  showing  surface 
grid.  Evolution  has  caused  imbalance  in  grid  density. 


are  triple  points  on  the  surface  of  the  interconnect, 
as  well  as  a  triple  line  and  tetrahedral  point  in  the 
interior.  The  corresponding  surface  grid  is  shown 
in  Figure  2.  It  is  clear  that  for  this  simulation  to 
continue,  some  of  triangles  must  be  annihilated  to 
prepare  for  topological  events  such  as  the  “pinch¬ 
ing  off”  of  a  grain.  That  is,  a  front  tracking 
simulation  must  involve  a  nontrivial  amount  of 
grid  manipulation  in  order  to  successfully  run  to 
completion.  In  Figure  3,  we  show  the  effect  of 
massage  which  is  a  grid  manipulation  command  in 
the  X3D  toolbox.  As  seen  in  the  figure,  the 
massage  command  has  derefined  the  unstructured 
mesh  without  significantly  damaging  the  shapes  of 
the  grains.  Indeed,  the  command  takes  as  user 
input  a  “damage”  tolerance  which  gives  the 
maximum  acceptable  amount  of  grain  shape 
deformation  allowable  in  the  derefinement  pro- 
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FIGURE  3  5  grain  evolved  grid  after  X3D  massage  com¬ 

mand.  The  grid  has  been  derefined  and  refined  according  to 
desired  edge  length  criteria  while  preserving  material  interfaces 
and  external  boundaries. 

cess.  Also  called  is  the  recon  command  which 
allows  changing  of  connectivity  of  the  associated 
tetrahedral  mesh  in  the  interior  of  the  volume.  Still 
a  work  in  progress,  we  anticipate  that  with  the 
availability  of  these  X3D  grid  manipulation  tools 
we  will  be  able  to  run  the  annealing  process  to 
steady-state.  These  microstructures  will  then  pro¬ 
vide  high  quality  three-dimensional  models  for 
electromigration  reliability  simulation. 

Finite  Volume  Electrostatic  Calculations 
on  a  Solution  Adapted  Mesh 

Problems  involving  fixed  surfaces  can  also  benefit 
from  adaptive  grids.  To  illustrate  this,  we  show  the 
effects  of  solution  adaptive  grid  generation  for 
calculation  of  electric  fields  in  nontrivial  geome¬ 
try  -  as  occurs  when  attempting  to  extract  para¬ 
sitic  parameters  for  interconnect  modeling.  Figure 
4  shows  the  interior  of  a  box  with  a  conical 
intrusion.  We  solve  Laplace’s  equation  on  this 
three-dimensional  domain  using  a  finite  volume 
solver.  (Only  the  surface  triangles  are  shown  in  the 
figure;  the  calculation  is  done  on  unshown  tetra- 
hedra  that  conform  to  the  surfaces  and  fill  the 
volume.)  As  is  well  known,  the  electric  field 
becomes  arbitrarily  large  near  the  tip  of  the  cone. 
Displayed  at  the  tip  of  the  cone  is  an  isosurface  for 
the  component  of  electric  field  aligned  with  the 


FIGURE  4  Surface  triangles  from  an  unadapted  grid  used  to 
solve  Laplace’s  equation  on  a  unit  cube  with  the  bottom  surface 
pierced  by  a  sharp  cone.  Displayed  at  the  tip  of  the  cone  is  an 
isosurface  of  the  component  of  the  electric  field  aligned  with  the 
axis  of  the  cone. 


FIGURE  5  Surface  triangles  after  error-dependent  grid 
adaption.  Note  improvement  in  electric  field  isosurface. 
Adaption  was  turned  off  on  the  surface  of  the  cone  to  prevent 
too  much  grid  from  disappearing  into  the  tip  of  the  cone  where 
the  electric  field  is  infinite. 

axis  of  the  cone.  As  can  be  seen,  this  isosurface  is 
nonsmooth  due  to  lack  of  resolution  in  this  area  of 
highest  solution  error.  In  Figure  5  we  show  the 
same  view  and  electric  field  component  isosurface 
for  the  solution  on  a  grid  that  has  been  adapted  to 
an  a  posteriori  error  estimate.  The  isosurface  is 
smoother,  indicating  that  adaptive  error-based 
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smoothing  has  successfully  attracted  the  solution 
elements  to  the  critical  area  near  the  tip  of  the 
cone.  (However,  to  prevent  the  entire  mesh  from 
disappearing  into  the  singularity  at  the  tip  of  the 
cone,  node  movement  on  the  surface  of  the  cone 
was  turned  off.) 

The  solution  adaption  was  accomplished  using 
the  X3D  smooth  command  which  adapts  a  mesh  to 
minimize  the  C?  norm  of  the  gradient  of  solution 
error,  as  calculated  using  an  a  posteriori  error 
estimate  based  on  estimated  second  derivatives  of 
the  solution  [5].  In  order  for  the  finite  volume 
method  to  work,  a  Delaunay  grid  is  required. 
Thus,  after  each  adaptive  smoothing  iteration,  the 
X3D  recon  command  is  called  to  adjust  the  mesh 
connectivity  to  restore  the  Delaunay  condition  on 
the  tetrahedral  mesh.  We  note  that  the  maximum 
electric  field  on  the  adapted  mesh  of  Figure  5  is 
approximately  ten  times  higher  than  that  on  the 
unadapted  mesh  of  Figure  4.  Thus,  without 
increasing  the  number  of  nodes  used,  the  solution 
adaption  using  X3D  toolbox  commands  was 
automatically  able  to  increase  the  quality  of  the 
solution  in  a  critical  area. 

3D  Topographic  Simulation 

In  simulating  topographic  etch  and  deposition, 
TopoSim3D  separates  the  chemical  and  physical 
processes  from  grid  generation  and  maintenance 
operations.  The  flux  calculation,  source  character¬ 
ization  and  chemical  reaction  mechanisms  have 
access  to  the  needed  X3D  data  structures  and  to  the 
X3D  geometry  services.  X3D  is  used  to  build  the 
initial  3D  tetrahedral  mesh  that  represents  the  wafer 
and  to  accurately  track  the  evolution  of  the  material 
interfaces  as  they  change  with  time.  As  material  is 
deposited  or  etched  away  from  interfaces,  those 
tetrahedral  faces  (interface  triangles)  that  lie  on  the 
boundary  between  two  materials  move  in  time. 
From  a  computation  of  the  flux  of  materials 
arriving  at  the  surface  of  the  interface  triangles, 
we  move  triangle  nodes  after  apportioning  the  area 
of  each  triangle  to  each  of  the  nodes  it  shares.  The 
velocity  of  each  node  is  then  computed  from  an 


area-weighted  vector  average  of  the  contributions 
from  each  of  the  triangles  that  share  the  node. 

In  low  pressure  simulations,  the  transport  of 
material  between  the  boundary  plane  and  the 
surface  of  a  wafer  is  represented  by  straight-line 
trajectories  of  particles.  The  flux  of  materials 
arriving  at  a  triangle  on  the  surface  interface 
depends  on  the  source  emanation  rate,  the  distance 
between  the  source  and  the  surface  element,  the 
relative  orientation  of  the  source  and  the  surface 
element,  and  the  visibility  of  the  surface  as  viewed 
from  the  source.  In  the  simplest  model,  all  the 
material  arriving  at  the  wafer  surface  stays  there, 
implying  that  the  only  source  elements  are  those 
that  lie  on  the  boundary  plane.  However,  if  some 
vapor  phase  materials  arriving  at  the  surface  do 
not  react  there,  they  (or  other  species)  will  be  re¬ 
emitted,  and  in  such  cases,  all  interface  triangles 
serve  as  potential  source  elements.  By  specifying 
the  surface  chemistry  (using  the  ChemKin  and 
Surface  ChemKin  reaction  software  libraries), 
TopoSim3D  incorporates  the  calculation  of  “stick¬ 
ing  coefficients”  in  a  natural  way.  If  the  rate  of  a 
surface  reaction  is  slower  on  a  surface  element 


FIGURE  6  Topographic  deposition  onto  two  materials  with 
different  ‘sticking  coefficients’.  View  a  shows  the  solid  model  of 
the  wafer.  View  b  gives  the  initial  surface  grid  of  the  hole  with 
the  front  surface  of  the  hole  removed.  View  c  shows  an 
intermediate  time  step  in  the  deposition.  View  d  shows  the  final 
time  step  in  the  deposition.  Note  the  differences  in  upper  and 
lower  material  ‘sticking  coefficients’. 
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FIGURE  6  (Continued). 


than  the  arrival  rate  of  reactants,  mass  balance 
requires  the  excess  material  to  be  re-emitted  from 
that  interface  triangle,  which  in  turn  becomes  an 
effective  source  element  for  other  surface  elements. 

As  in  grain  growth  modeling,  the  X3D  grid 
maintenance  calls  are  used  to  refine,  reconnect  and 
smooth  the  interface  surface  during  the  time 
evolution.  TopoSim3D  uses  the  X3D  data  struc¬ 


tures  to  help  avoid  folding  problems  and  detect 
topological  events  such  as  a  pinch-off.  Figure  6 
shows  deposition  on  an  overhang  structure  with 
material-dependent  sticking  coefficients. 

CONCLUSION 

As  shown  in  the  three  examples  presented,  X3D 
grid  optimization  techniques  are  well  suited  to 
solving  time-dependent  and  geometrically  challen¬ 
ging  problems  occurring  in  semiconductor  appli¬ 
cations. 
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Molecular  Dynamics  (MD)  is  a  powerful  tool  for  the  atomistic  understanding  of  long- 
range  stress-mediated  phenomena,  phonon  properties,  and  mechanical  failure  of 
nanostructures.  For  realistic  modeling  of  nanostructures,  however,  the  scope  of 
simulations  must  be  extended  to  large  system  sizes,  long  simulated  times,  and  complex 
realism.  We  have  developed  new  multilevel  algorithms  and  physical  models  encom¬ 
passing  multiple  levels  of  abstraction:  i)  space-time  multiresolution  schemes;  ii)  adaptive 
curvilinear-coordinate  load  balancing;  iii)  hierarchical  dynamics  via  a  rigid-body/ 
implicit-integration/normal-mode  approach;  iv)  variable-charge  MD  based  on  electro¬ 
negativity  equalization;  and  v)  multilevel  preconditioned  conjugate  gradient  method. 
Fuzzy  clustering  is  used  to  facilitate  the  seamless  integration  of  the  multiple  levels  of 
abstraction. 


Keywords:  Molecular  dynamics,  parallel  computing,  multilevel  algorithms,  nanostructures 


As  new  fabrication  technologies  for  quantum 
nanostructures  emerge  (such  as  strain-induced 
self-organized  growth  [1]  and  substrate  encoded 
size-reduced  epitaxy  [2] ),  there  is  growing  need  for 
hybrid  atomistic/mesoscopic  computer  simula¬ 
tions.  Molecular  Dynamics  (MD)  [3]  is  a  powerful 
tool  for  the  atomistic  understanding  of  long-range 
stress-mediated  phenomena,  phonon  properties, 
and  mechanical  failure  of  nanostructures.  For 
realistic  modeling  of  nanostructures,  however,  the 
scope  of  simulations  must  be  extended  to  larger 
system  sizes,  longer  simulated  times,  and  more 
complex  realism  than  what  has  been  feasible  until 


recently.  In  this  paper  we  describe  various  new 
multilevel  algorithms  for  large-scale,  long-time 
MD  simulations. 

In  MD  simulations,  a  system  is  represented  by  a  set 
of  atomic  coordinates,  {xz-|  i  =  1, . .  TV},  where  N  is 
the  number  of  atoms.  Time  evolution  of  the  system  is 
governed  by  Newton’s  second  law  of  motion  [3], 

«.-^r  =  *(*),  (i) 

where  and  g i({xi})  = -dV/dXi  are  the  mass  and 
force  for  the  z-th  atom.  The  potential  energy 
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function  F({x/})  consists  of  a  sum  over  atomic 
pairs  and  triples  [4]. 

The  most  prohibitive  computational  problem  in 
MD  simulations  is  associated  with  the  Coulomb 
potential.  Because  of  its  long  range,  each  atom 
interacts  with  all  the  other  atoms  in  the  system. 
Therefore  the  evaluation  of  the  Coulomb  potential 
requires  0(N2)  operations.  An  MD  algorithm  is 
developed  based  on  multiresolutions  in  both  space 
and  time  [5].  The  long-range  Coulomb  interaction 
is  computed  with  the  Fast  Multipole  Method 
(FMM)  [6,  7].  The  FMM  uses  the  truncated  multi¬ 
pole  expansion  and  local  Taylor  expansion  for  the 
Coulomb  potential  field  (see  Fig.  1).  By  computing 
both  expansions  recursively  on  a  hierarchy  of  cells, 
the  Coulomb  potential  is  computed  with  O(N) 
operations.  Short-  and  medium-range  non- 
Coulombic  interactions  are  computed  with  the 
Multiple  Time-Scale  (MTS)  approach  [8,9].  The 
MTS  method  is  based  on  the  fact  that  the 
farther  the  distance  between  particles  the  slower 
is  the  time  variation  of  forces.  Therefore  different 
time  steps  are  used  to  computer  forces  for  diffe¬ 
rent  interparticle  separations.  To  implement  this 
MultiResolution  Molecular  Dynamics  (MRMD) 
algorithm  on  parallel  computers,  we  use  spatial 
decomposition.  Processors  are  logically  orga¬ 
nized  as  a  cubic  array  of  dimensions  Px  x 
x  Pz,  and  we  partition  the  simulation  system  into 
subsystem  into  subsystems  of  equal  volume 
accordingly  (Fig.  2).  For  a  4.2  million-atom  Si02 
system,  one  MD  step  takes  only  4.8  seconds  on  the 
512-node  Intel  Touchstone  Delta  machine  [5],  The 
memory-bound  parallel  efficiency  [10]  of  the 
program  is  0.92  and  the  communication  overhead 
is  8%. 

Simulation  of  nanostructures  is  often  character¬ 
ized  by  irregular  atomic  distribution.  One  practical 
problem  in  simulating  such  irregular  systems  on 
parallel  computers  is  that  of  load  imbalance  [11]. 
Because  of  the  irregular  distribution  of  atoms,  the 
uniform  spatial  decomposition  results  in  unequal 
partition  of  workloads  among  processors.  As  a 
result  the  parallel  efficiency  is  degraded  signifi¬ 
cantly. 


Long-range  Coulomb  Short-range 


FIGURE  1  Schematic  representation  of  spatial  multiresolu¬ 
tion  for  a  two-dimensional  system.  (Left)  A  hierarchy  of  cells  in 
the  fast  multipole  method.  (Right)  The  direct  forces  on  a 
particle  (solid  circle)  are  due  to  the  primary  (open  circles  within 
the  hatched  area),  secondary  (open  circles  within  the  shaded 
area),  and  tertiary  (the  other  open  circles)  neighbor  atoms. 


FIGURE  2  Regular  spatial  decomposition  scheme  for  parallel 
computing.  (Left)  The  physical  system  is  divided  into  sub¬ 
systems  of  equal  volume.  Spheres  and  planes  represent  atoms 
and  subsystem  boundaries,  respectively.  (Right)  Each  subsys¬ 
tem  is  mapped  onto  a  computing  node  in  a  parallel  computer. 


To  avoid  this  problem,  a  dynamic-load-balan¬ 
cing  capability  is  added  to  the  MRMD  program 
[12].  The  new  load-balancing  scheme  introduces  a 
curvilinear  coordinate  system  [13],  £,  which  is 
related  to  the  atomic  coordinate,  x,  by  a  mapping, 

£  =  *  +  X]xeexp(/Qx).  (2) 

Q 

Workloads  are  partitioned  with  a  uniform  3- 
dimensional  mesh  in  the  curvilinear  coordinate 
system.  The  variational  parameters  {xq}  are 
chosen  to  minimize  the  load-imbalance  and  com¬ 
munication  costs.  Simulated  annealing  is  used  to 
solve  the  optimization  problem.  For  an  irregular 
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nanocluster-assembled  material,  the  load-balan¬ 
cing  scheme  sped  up  simulations  by  a  factor  of  4.2 
[12]. 

Many  important  material  processes  (e.g.,  sinter¬ 
ing  and  sol-gel  processes)  are  characterized  by  time 
scales  that  are  many  orders-of-magnitude  larger 
than  atomic  time  scales  (10-15sec).  A  new 
algorithm  is  developed  for  large-scale,  long-time 
MD  simulations  by  combining  a  hierarchy  of 
subdynamics  (Fig.  3)  [14].  Equation  (1)  is 
numerically  integrated  using  a  reference  system 
rI5  which  represents  two  types  of  essential  dy¬ 
namics,  i.e.,  global  conformational  changes  and 
fast  atomic  oscillations.  The  reference  system  is 
thus  defined  as  a  superposition,  r;  =  r0;  +  r^, 
where  r0i  and  rhi  are  the  collective  and  harmonic 
parts,  respectively. 

The  collective  part  of  the  reference  system 
represents  the  rigid-body  motion  of  clusters.  We 
use  the  quaternion  formulation  of  rigid-body 
dynamics  in  order  to  avoid  the  numerical  singu¬ 
larity  associated  with  angular  coordinates  [15].  A 
large  time  step  (10“12  sec)  is  used  for  the  numerical 
integration.  The  harmonic  part  rhl-  of  the  reference 


FIGURE  3  Various  physical  processes  involved  in  the  sin¬ 
tering  of  nanoclusters,  i)  Relative  rotation  of  clusters  is  included 
by  rigid-body  dynamics  with  fuzzy  clustering;  ii)  anharmonic 
atomic  motions  lead  to  surface  diffusion  and  the  growth  of  the 
neck  between  clusters,  and  these  motions  are  included  by 
implicit  integration  of  Newton’s  equations;  iii)  thermal  atomic 
motions  assist  the  above  diffusion  process,  and  these  high- 
frequency  motions  are  dealt  with  through  the  normal-mode 
analysis. 


system  represents  the  fast  oscillation  of  each  atom 
around  the  local  potential  minimum,  and  its 
equation  of  motion  can  be  integrated  analytically 
in  terms  of  trigonometric  functions  [16].  The 
residual  system,  defined  as  Z/-x*  -  rf,  is  expected 
to  vary  slowly,  since  the  rapidly  oscillating 
harmonic  motions  have  been  subtracted.  There¬ 
fore  its  equation  is  integrated  by  an  implicit 
integration  scheme  using  At  which  is  much  larger 
than  atomic  time  scales  [17].  The  integration 
scheme  is  stable  for  an  arbitrarily  large  At,  and 
it  is  also  symplectic  [17].  Symplectic  integrators 
preserve  the  phase-space  volume,  and  this  is 
essential  for  the  long-time  stability  of  orbitals. 

The  greatest  challenge,  however,  is  to  integrate 
these  heterogeneous  abstraction  levels  into  a 
seamless,  unified  scheme.  To  facilitate  such  inte¬ 
gration,  we  find  it  useful  to  introduce  the  concept 
of  fuzzy  clustering  [18, 19].  We  introduce  a  mem¬ 
bership  function  P  (i  e  c)  which  describes  the 
degree  of  association  between  atom  i  and  cluster 
c.  The  principle  of  maximum  entropy  is  used  to 
determine  P(iec).  The  fuzzy-body/Implicit-inte- 
gration/Normal-mode  (FIN)  scheme  sped  up  a 
simulation  of  nanocluster  sintering  by  a  factor  of 
28  over  a  conventional  explicit  integration  scheme, 
without  loss  of  accuracy.  A  parallel  implementa¬ 
tion  of  the  scheme  achieves  an  efficiency  of  0.94  for 
a  12.7  million-atom  nanocrystalline  solid  on  64 
nodes  of  an  IBM  SP2  computer  [14]. 

Conventional  interatomic  potential  functions 
used  in  MD  simulations  are  often  fitted  to  bulk 
solid  properties,  and  they  are  not  transferable  to 
systems  containing  defects,  cracks,  surfaces,  and 
interfaces.  In  these  systems,  the  partial  charges  on 
the  atoms  vary  dynamically  according  to  the 
change  in  the  local  environment.  This  environ¬ 
ment-dependent  charge  distribution  is  crucial  for 
the  physical  properties  of  these  systems  including 
the  fracture  toughness.  Transferability  of  intera¬ 
tomic  potentials  is  greatly  enhanced  by  incorpor¬ 
ating  variable  atomic  charges  which  dynamically 
adapt  to  the  local  environment.  Atomic  charges 
can  be  determined  by  equalizing  electronegativity 
[20]. 
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However,  the  increased  physical  realism  in  the 
variable-charge  MD  is  accompanied  by  increased 
computational  cost  for  minimizing  the  electro¬ 
static  energy  at  every  MD  step.  This  minimization 
is  equivalent  to  the  electronegativity  equalization 
condition  that  the  chemical  potentials  be  equal  for 
all  the  atoms.  This  condition  leads  to  a  linear 
equation  system  for  atomic  charges,  { qt }: 

=  /i  —  Xii  (3) 

j 

where  My  denotes  the  Coulomb-interaction  ma¬ 
trix,  Xi  is  the  electronegativity,  and  the  Lagrange’s 
multiplier  [i  is  determined  from  the  charge- 
neutrality  constraint. 

A  Multilevel  Preconditioned  Conjugate-Gradi¬ 
ent  (MPCG)  method  is  developed  for  this 
minimization  problem  by  splitting  the  Coulomb- 
interaction  matrix  into  short-  and  long-range 
components:  M  =  Ms  +  Mi  [21].  The  short-range 
matrix  is  the  contributions  from  atomic  pairs  (ij) 
within  the  nearest  neighbor  leaf  cells  used  in  the 
FMM  [5-7].  The  sparse  short-range  matrix  Ms  is 
used  as  a  preconditioner  to  improve  the  spectral 
property  of  the  linear  system  and  thereby  accel¬ 
erating  the  convergence  [22].  For  a-Al203  crystal, 
the  preconditioner  reduces  the  execution  time  to 
achieve  the  same  convergence  level  by  20%  [21]. 
Numerical  tests  involving  up  to  26.5  million  atoms 
are  performed  on  an  IBM  SP2  computer.  Figure  4 
shows  the  parallel  efficiency  (solid  lines)  and 
communication  overhead  (dashed  lines)  as  a 
function  of  the  number  of  atoms.  The  results  with 
and  without  preconditioning  are  denoted  by  circles 
and  squares,  respectively.  For  the  largest  system, 
the  preconditioning  improves  the  parallel  effi¬ 
ciency  from  0.92  to  0.95  [21].  The  communication 
overhead  of  the  MPCG  scheme  is  5%  of  the  total 
execution  time  for  the  largest  system.  The  multi¬ 
level  preconditioning  scheme  enhances  the  locality 
of  computation  by  extensively  using  the  short- 
range  interaction  matrix  Ms,  and  consequently  the 
program  runs  efficiently  on  parallel  platforms. 


FIGURE  4  Memory-bound  parallel  efficiency  of  the  Multi¬ 
level  Preconditioned  Conjugate  Gradient  (MPCG)  program 
(solid  lines)  as  a  function  of  the  number  of  atoms.  Open  circles 
and  open  squares  are  the  results  for  the  MPCG  and  the 
Conjugate  Gradient  (CG)  methods,  respectively.  Communica¬ 
tion  overheads  of  the  same  program  are  shown  by  the  dashed 
lines. 


In  summary,  we  have  developed  various  parallel 
multilevel  algorithms  for  multiscale  phenomena  in 
nanostructures.  Using  these  algorithms,  multi¬ 
million-atom  MD  simulations  are  being  performed 
for:  i)  nanocluster-assembled  Si3N4  [23,  24],  Si02, 
SiC,  and  A1203;  ii)  Si/Si3N4  and  A1/A1203  inter¬ 
faces;  and  iii)  GaAs  stepped  surfaces  and  mesas. 
Mechanical  properties  including  fracture,  long- 
range  stress-mediated  phenomena,  and  phonon 
properties  in  these  nanostructures  are  being 
investigated. 
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We  have  been  using  a  self-consistent  formulation  of  full-wave  electromagnetic  solvers  and 
ensemble  Monte  Carlo  techniques  to  model  ultrafast  photoconductivity.  Our  simulations 
are  running  on  a  MasPar  machine.  This  paper  will  address  aspects  of  this  simulation  which 
may  interest  workers  who  are  simulating  not  only  photoconductive  systems  but  other 
systems  as  well  which  involve  electrodynamics,  waves  and  wave  phenomena  and  ensemble 
Monte  Carlo  transport  models.  In  particular,  we  will  report  on  the  inclusion  of  perfectly 
matched  layer  approaches  to  absorbing  boundary  conditions  for  electromagnetic  waves. 
These  have  in  the  past  several  years  become  widely  used  in  computational  electro¬ 
magnetics  codes  because  they  reduce  error  due  to  spurious  numerical  wave  reflection  off  of 
an  absorbing  boundary  by  several  orders  of  magnitude.  We  will  also  address  the  issue  of 
computational  cost  and  show  that  a  full-wave  electromagnetic  approach  is  more 
competitive  with  a  Poisson’s  equation  approach  than  one  might  believe.  Lastly,  our 
system  has  the  feature  that  the  active  portion  where  the  electrons  and  holes  lie  is  in  fact  a 
small  fraction  of  the  total  experimental  system’s  volume.  Unless  care  is  exerted  one  either 
has  a  very  significant  load  imbalance  problem  or  high  communications  overhead.  We 
compare  two  different  tradeoffs  between  load  imbalance  and  communications  overhead. 


Keywords:  Monte  Carlo,  electrodynamic,  full-wave,  parallel  processor,  PML,  wave-absorbing 
boundaries 


INTRODUCTION 

Semiconductor  device  models  always  incorporate 
electromagnetic  forces,  usually  by  a  quasistatic 
field  calculation.  The  electric  field  is  allowed  to 


vary  in  time  but  its  spatial  derivatives  are  assumed 
to  be  electrostatic  in  nature,  that  is 

V«E =p/e  (1) 
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and 

V  x  E  =  0.  (2) 

The  irrotational  electric  field  in  equation  (2)  is 
the  essential  approximation  being  made.  In  con¬ 
trast,  one  can  use  an  electrodynamic  or  full-wave 
electromagnetic  model  [1-8]  in  which  one  solves 
Maxwell’s  curl  equations 

V  x  E  =  -dB/dt  (3) 

VxH  =  J  +  dD/dt.  (4) 

In  this  paper  we  will  discuss  this  issue  using  a 
system  where  the  transport  model  is  an  ensemble 
Monte  Carlo  model.  We  will  discuss  load  imbal¬ 
ance/communications  overhead  problems  asso¬ 
ciated  with  the  parallelization  of  such  a  system, 
show  that  a  full-wave  electromagnetic  approach  is 
more  cost  competitive  than  one  might  believe  and 
will  report  on  the  inclusion  of  perfectly  matched 
layer  approaches  to  absorbing  boundary  condi¬ 
tions  for  electromagnetic  waves. 

TRADEOFFS  BETWEEN  QUASI-STATIC 
AND  FULL-WAVE  MODELS 

The  physical  motivation  for  switching  to  a  full- 
wave  model  of  a  semiconductor  device  may  be 
either  internal  or  external  to  the  device.  Our 
devices  are  often  incorporated  into  a  larger  system. 
In  some  cases,  e.g.  when  computing  broadband 
response  in  the  millimeter-wave  and  submilli¬ 
meter-wave  regions,  electrodynamic  models  of 
the  external  components  are  used.  Incorporating 
full-wave  techniques  into  a  device  model  is  then 
quite  natural. 

The  internal  physical  issue  is  when  does  a 
quasistatic  approach  to  the  internal  field  calcula¬ 
tion  break  down?  The  typical  quasi-static  device 
model  solves  Poisson’s  equation 

V2F=-p/£  (5) 

to  determine  a  scalar  potential  function  V.  Fields 
and  forces  are  then  computed  by  taking  the 


gradient  of  this  scalar  potential  i.e. 

E  =  —  W.  (6) 

The  boundary  value  problem  associated  with 
Poisson’s  equation  is  computationally  demanding. 
The  gradient  operation  of  equation  (6)  on  the 
other  hand  is  computationally  trivial.  Poisson’s 
equation  can  always  be  used,  even  in  a  fully 
electrodynamic  setting,  if  one  makes  the  appro¬ 
priate  selection  of  gauge  [9,  10].  What  fails  in 
electrodynamic  setting  is  the  computationally 
trivial  gradient  operation  of  equation  (6).  Instead, 
we  must  additionally  solve  for  the  vector  potential 
and  compute  the  electric  field  using 

E  —  —  VF—  dA/dt.  (7) 

Equation  (6)  implies  the  irrotational  field  of 
equation  (2)  but  equation  (7)  does  not. 

Another  part  of  the  tradeoff  between  quasistatic 
and  full-wave  approaches  is  the  issue  of  computa¬ 
tional  cost.  Before  dealing  with  this  issue,  a  more 
detailed  discussion  of  our  computational  system  is 
needed.  We  have  been  implementing  this  system 
on  a  MasPar  MP-2  machine.  This  machine  has 
8192  Processing  Elements  (PE)  connected  in  a 
mesh.  It  is  a  SIMD  machine  with  two  routes  for 
data  transfer  between  elements.  The  fastest  com¬ 
munication  is  between  nearest  neighbor  PEs  on  a 
network  called  the  Xnet.  The  slower  route  is  a 
global  router  which  communicates  between  any 
two  PEs.  As  described  before  [1  -3],  we  will  couple 
two  different  calculations  together.  We  self-con- 
sistently  couple  an  Ensemble  Monte  Carlo  (EMC) 
model  with  a  finite  difference  solution  of  Max¬ 
well’s  curl  equations.  These  two  calculations 
involve  different  types  of  data.  The  EMC  calcula¬ 
tion  has  data  which  is  associated  with  a  specific 
particle  while  the  electromagnetic  calculation  has 
data  which  is  associated  with  a  location  in  space. 
There  are  natural  methods  for  parallelizing  both 
computations  but  these  methods  differ  in  their 
utilization  of  the  PE  array  due  to  this  difference  in 
data  type.  In  the  EMC  calculation  one  wishes  to 
evenly  distribute  carries  over  the  PE  array  while  in 
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the  field  computation  one  assigns  adjoining 
regions  of  space  to  nearest  neighbors  PEs.  Our 
problem  is  to  simultaneously  parallelize  both 
solutions. 

Two  basic  strategies  for  this  simultaneous 
parallelization  will  be  compared  here.  In  both 
algorithms,  we  compute  the  fields  using  the 
adjacent  PE  approach  mentioned  above.  (The 
initial  condition  for  the  fields  is  produced  by 
solving  Poisson’s  equation  using  a  red/black  SOR 
technique  [11]).  In  algorithm  1,  an  equal  number 
of  particles  is  assigned  to  each  processor  thus 
alleviating  any  load  imbalance  in  the  EMC 
calculation  but  a  high  price  is  paid  for  commu¬ 
nication  between  the  PEs  during  the  process  where 
we  switch  between  the  two  calculations.  In 
algorithm  2,  a  region  of  space  is  assigned  to  a 
single  processor  even  during  the  EMC  calculation. 
This  processor  handles  the  particles  found  in  this 
region.  This  minimizes  communication  overhead 
but  introduces  a  significant  load  imbalance  as  the 
EMC  region  is  only  a  small  part  of  the  simulation 
domain.  In  order  to  alleviate  this  load  imbalance, 
the  field  information  for  just  the  EMC  region  is 
duplicated  throughout  the  PE  array  so  that 
carriers  can  be  located  on  any  PE  which  has  the 
appropriate  field  information. 

We  apply  this  to  the  physical  system  shown  in 
Figure  1.  A  nonuniform  grid  was  used.  The  grid 
used  a  2  micron  spacing  in  all  three  dimensions  in 
the  passive  region  and  a  0.1  micron  spacing  in  the 
EMC  region.  There  is  a  gradual  transition  between 
the  two  extremes.  The  run  times  of  these  two 
algorithms  are  shown  in  Table  I.  They  are  broken 
down  into  entries  for  the  field  calculation,  the  MC 
calculation  and  the  coupling  of  these  two.  As 
expected,  algorithm  1  spent  less  time  on  the  EMC 
calculation  due  to  better  load  balance  but  more 
time  in  coupling  the  two  calculations  due  to  the 
communication  overhead.  Over  all,  algorithm  1 
was  somewhat  superior  to  algorithm  2  for  this 
particular  example.  The  performance  of  the 
algorithms  though  is  sufficiently  close  that  no 
conclusion  should  be  reached  that  algorithm  1  is 
always  superior  to  algorithm  2. 


Now,  how  does  a  quasistatic  computation 
compare  in  numerical  cost  with  a  full  wave 
approach?  Since  we  have  both  a  Poisson  solver 
and  a  full-wave  model  here,  we  can  compare  the 
two.  We  duplicate  the  computation  just  described 
only  we  proceed  quasistatically.  (Of  course,  a 
quasistatic  solution  of  a  transmission  line  transient 
response  in  unphysical  but  our  goal  is  to  compare 
computational  costs).  This  run-time  data  is  also 
shown  in  Table  I  along  with  the  number  of 
iterations  required  per  time  step.  The  full- wave 
model  uses  1  iteration  per  time  step  while  the 
number  of  iterations  per  timestep  in  the  Poisson’s 
solver  depends  on  the  convergence  criteria.  If  too 
many  iterations  are  required  per  timestep,  the 
quasi-static  model  loses  any  cost  advantage  it  may 
have  had  over  a  full-wave  solution.  In  fact,  for  a 
very  tight  convergence  criteria,  the  quasistatic 
model  is  signficantly  more  expensive  than  the 
full-wave!. 

Examination  of  Figure  1  shows  that  we  are 
simulating  an  open  system  with  electromagnetic 
waves  propagating  away  towards  infinity.  As  our 
grid  is  finite  in  size  we  need  to  minimize  the 
numerical  reflection  of  these  outgoing  waves  at  the 


FIGURE  1  The  system  simulated  here  is  a  microstrip  line 
structure  on  a  GaAs  substrate.  A  dc  bias  is  applied  and  the 
system  is  which  will  be  excited  by  a  subpicosecond  pulse  in  the 
Monte  Carlo  or  MC  region  shown.  An  electro-optic  material  is 
place  over  the  top  of  the  system.  The  grid  size  for  the 
electromagnetics  code  was  127  x  63  x  54.  8192  electrons  and 
8192  holes  were  included  in  the  Monte  Carlo  calculations. 
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TABLE  I  Computational  run  times  for  various  solution  techniques 


Method 

Average 

Iteration 

Field 

(sec/lOOOiter) 

Coupling 

(sec/lOOOiter) 

MC 

(sec/lOOOiter) 

Total 

(sec/lOOOiter) 

Dynamic 
algorithm  1 

1 

589.4 

814.6 

40.1 

1444.1 

Dynamic 
algorithm  2 

1 

588.8 

335.8 

823.6 

1748.2 

Static 

error:  1E-3V 

1.98 

354.5 

380.7 

35.8 

771.0 

Static 

error:  1E-4V 

11.8 

975.6 

378.6 

40.6 

1394.8 

Static 

error:  1E-5V 

65.4 

4375.8 

379.7 

39.5 

4795.0 

end  of  our  numerical  grid.  There  has  been  notice¬ 
able  advance  in  such  absorbing  boundary  condi¬ 
tions  for  electromagnetic  waves  in  the  past  several 
years,  usually  associated  with  the  phrase  “Perfectly 
Matched  Layer”  or  PML  [12-18].  These  techni¬ 
ques  have  reduced  the  numerical  error  associated 
with  spurious  numerical  reflection  by  several 
orders  of  magnitude  and  are  particularly  advanta¬ 
geous  for  the  case  of  oblique  incidence.  We  use  the 
technique  of  Berenger  [12-14]  although  other 
techniques  have  been  developed  as  well  [15-18]. 
Here  we  will  outline  the  conceptual  nature  of  these 
techniques  and  refer  the  reader  to  the  references 
for  details. 

The  essential  idea  is  to  add  a  fictitious  “Perfectly 
Matched  Layer”  on  the  outside  of  all  surfaces 
where  we  have  to  implement  a  wave  absorbing 
boundary.  We  then  extend  our  numerical  grid  out 
into  this  fictitious  layer,  terminating  it  at  the  end  of 
this  layer.  Since  this  is  a  purely  artifical  layer,  we 
are  free  to  pick  its  material  properties  to  meet  two 
constraints.  First,  the  reflection  coefficient  asso¬ 
ciated  with  the  interface  between  the  valid  solution 
space  and  the  fictitious  absorbing  space  must  be 
zero  for  all  frequencies  and  all  angles  of  incidence. 
This  will  eliminate  numerical  reflection  off  of  this 
interface.  However,  accomplishing  this  goal  just 
moves  the  original  problem  to  the  new  truncation 
of  the  grid  at  the  far  end  of  the  fictitious  absorbing 
layer.  We  will  have  a  reflection  off  of  this 
truncation.  Therefore  our  second  constraint  on 
the  PML  is  that  it  must  be  a  lossy  medium  which 
attenuates  electromagnetic  waves.  The  techniques 


described  in  the  references  succeed  in  meeting  both 
constraints. 

In  summary,  one  can  proceed  with  an  electro¬ 
dynamically  based  approach  in  the  modeling  a 
semiconductor  device.  For  the  cases  where  one  is 
already  expending  significant  computational  re¬ 
sources  e.g.  three  dimensional  device  models, 
quasistatic  and  electrodynamic  techniques  do  not 
necessarily  differ  significantly  in  computational 
cost.  If  one  does  so,  the  use  of  a  PML  type  of 
absorbing  boundary  is  recommended.  Care  how¬ 
ever  may  be  required  if  one  implements  the  system 
on  a  massively  parallel  processor  array  as  the  field 
calculation  and  the  transport  model  may  employ 
very  different  types  of  data  and  additionally,  the 
device  itself  may  be  a  comparatively  small  portion 
of  the  simulation  domain. 
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In  this  paper,  we  present  a  mesoscopic-macroscopic  model  of  self-consistent  charge 
transport.  It  is  based  upon  an  asymptotic  expansion  of  solutions  of  the  Boltzmann 
Transport  Equation  (BTE).  We  identify  three  dimensionless  parameters  from  the  BTE. 
These  parameters  are,  respectively,  the  quotient  of  reference  scales  for  drift  and  thermal 
velocities,  the  scaled  mean  free  path,  and  the  scaled  Debye  length.  Such  parameters 
induce  domain  dependent  macroscopic  approximations.  Particular  focus  is  placed  upon 
the  so-called  high  field  model,  defined  by  the  regime  where  drift  velocity  dominates 
thermal  velocity.  This  model  incorporates  kinetic  transition  layers,  linking  mesoscopic 
to  macroscopic  states.  Reference  scalings  are  defined  by  the  background  doping  levels 
and  distinct,  experimentally  measured  mobility  expressions,  as  well  as  locally 
determined  ranges  for  the  electric  fields.  The  mobilities  reflect  a  coarse  substitute  for 
reference  scales  of  scattering  mechanisms.  See  [9]  for  elaboration. 

The  high  field  approximation  is  a  formally  derived  modification  of  the  augmented 
drift-diffusion  model  originally  introduced  by  Thornber  some  fifteen  years  ago  [25].  We 
are  able  to  compare  our  approach  with  the  earlier  kinetic  approach  of  Baranger  and 
Wilkins  [5]  and  the  macroscopic  approach  of  Kan,  Ravaioli  and  Kerkhoven  [20]. 


Keywords:  Asymptotic  parameters,  mesoscopic-macroscopic  model,  augmented  drift-diffusion, 
high  field  model,  domain  decomposition 


1.  INTRODUCTION 

This  presentation  is  motivated  by  the  search  for 
more  reliable  macroscopic  models  of  high  field 
transport  in  submicron  structures  which  are 


computationally  efficient.  This  is  a  research  topic 
of  major  interest  to  the  micro-electronics  industry. 

Transport  in  submicron  structures  differs  from 
transport  in  bulk  material  in  many  ways.  It 
includes  far  from  equilibrium  situations  created 
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by  large  electric  fields  in  small  structures;  ballistic 
electrons,  when  dimensions  of  the  drift-forcing 
term  are  of  the  same  order  as  the  mean  free  path; 
and  new  effects,  introduced  by  the  close  proximity 
to  the  boundaries  of  regions  where  the  density 
gradients  are  (relatively)  rapidly  changing. 

Most  of  the  previous  work  on  transport  in 
submicron  semiconducting  structures  used  several 
corrections  to  drift-diffusion  equations  in  which 
the  current  is  a  sum  of  drift  and  diffusion  terms, 
where  both  have  field-dependent  mobility  and 
diffusion  coefficients,  respectively.  References  on 
models  and  computational  methods  for  N+  ~  N  - 
N+  structures  of  Si  or  GaAs  can  be  found  in  the 
literature  of  the  last  decades  [24,  23,  16,  21,  25,  20]. 
However,  the  computations  have  not  yielded 
results  which  have  been  genuinely  satisfactory, 
and  the  main  reason  has  been  the  lack  of  under- 
stading  of  the  accuracy  of  the  modeling  in  many 
heterogeneous  structures.  An  alternative  to  the 
drift-diffusion  equations  has  been  Monte  Carlo 
simulation,  a  very  costly  computational  procedure. 

An  approach  intermediate  between  drift-diffu¬ 
sion  and  Monte  Carlo  simulation  was  generated  by 
the  hydrodynamic  like  theory,  using  velocity  mo¬ 
ments  of  the  Boltzmann  equation  (see  [6,  2,  3,  15]) 
and  realizing  closure  by  imposing  constitutive 
relationships  between  the  state  macroscopic  vari¬ 
ables.  Extensive  numerical  modeling  of  N+  -N~~ 
N+  Si  and  GaAs  structures  under  several  geometric 
constraints  can  be  found  in  [12,  17,  18,  19].  For  the 
derivation  of  these  models,  the  distribution  function 
is  often  assumed  to  be  a  Maxwellian  distribution. 
However,  this  assumption  has  been  shown  to  be 
questionable  in  lightly  doped  submicron  structures, 
in  part  because  of  the  presence  of  ballistic  electrons. 

However,  it  has  been  mathematically  proved 
that  hydrodynamic  models  relax  to  energy  trans¬ 
port  models.  The  latter  have  been  derived  as 
macroscopic  limits  associated  with  a  particular 
choice  of  space-time  scale  that  makes  some 
collision  mechanisms  dominant  (see  [10]  for  a 
rigorous  proof  in  the  case  of  the  relaxation  of  the 
full  hydrodynamic  model).  These  models,  how¬ 
ever,  cannot  describe  situations  where  the  drift¬ 


forcing  term  and  the  resulting  scaled  mean  free 
path  are  of  the  same  order,  but  they  are 
asymptotically  correct  in  regions  where  the  pre¬ 
defined  background  is  very  slowly  varying  (i.e., 
almost  constant  doping).  A  new  approach  at¬ 
tempting  to  fill  in  the  middle  ground  among  drift- 
diffusion,  hydrodynamic  energy  transport,  and 
Monte  Carlo  simulation  (see  [4,  5,  26])  was  the 
use  of  direct,  numerical  solution  of  the  Boltzmann 
equation  within  the  relaxation  time  approxima¬ 
tion.  The  use  of  the  Boltzmann  equation  for 
non-electron  transport  is  justified  in  the  above 
references,  as  well  as  in  [7,  13].  The  relaxation  time 
approximation  within  the  Boltzmann-Poisson  sys¬ 
tem  approach,  positing  that  all  scattering  pro¬ 
cesses  can  be  characterized  by  a  few  scattering 
rates  setting  up  the  scale  of  relaxation  time 
constants,  may  not  be  strictly  valid  for  GaAs  [11]. 

Our  goal,  however,  is  to  present  a  full  solvable 
model  for  an  N+  -N-N+  structure,  as  an 
alternative  to  all  previous  calculations,  which  will 
produce  comparable  results  with  greater  efficiency 
than  attained  by  [4,  5,  20].  In  particular,  we 
incorporate  into  the  modeling  the  macroscopic 
limit  associated  with  the  choice  of  space,  time, 
potential  drop,  and  local  electric  field  scales, 
combined  with  the  relaxation  time  scale,  which 
makes  the  drift-forcing  term  and  collisions  both 
dominant,  and  of  the  same  scale  set  up  by  the  local 
scaled  mean  free  path. 

As  a  consequence  of  this  approach,  we  incorpo¬ 
rate  a  level  of  modeling  that  takes  into  account 
mesoscopic-macroscopic  multiscales,  and  corre¬ 
sponding  limit  derivations,  according  to  different 
scales  becoming  dominant  in  different  regions  of 
the  N+  -N-N+  device,  depending  on  the  spatial 
inhomogeneity  of  the  lightly  doped  region. 

2.  PRELIMINARIES  OF  THE  HIGH 
FIELD  MODEL 

We  identify  three  dimensionless  parameters,  viz., 

•  The  ratio  rj  of  drift  and  free  velocity  (the  latter 

usually  taking  on  the  thermal  velocity); 
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•  The  scaled  mean  free  path  e  and 

•  The  scaled  Debye  length  7  from  the  electrostatic 

potential  equation  of  a  self-consistent  model. 

The  dimensionless  parameters  induce  domain 
dependent  macroscopic  approximations,  which  are 
valid  within  regimes  that  do  not  change  the  scale 
of  the  Debye  length.  This  is  of  paramount 
importance  in  the  macroscopic  derivation  asso¬ 
ciated  with  the  BTE  under  a  self-consistent  field: 
the  scaling  of  the  forcing  term  depends  on  the 
scaled  external  and  internal  fields,  and  must  not 
override  the  limiting  regime  of  the  macroscopic 
derivation.  Therefore,  this  model  incorporates 
kinetic  transition  layers  linking  mesoscopic  and 
macroscopic  states. 

Reference  scalings  are  defined  by  the  back¬ 
ground  doping  levels  and  distinct,  experimentally 
measured  mobilities  which  serve  to  set  up  the  scale 
of  the  relaxation  time  approximation  of  the 
scattering  mechanisms.  In  addition,  scales  take 
into  account  locally  determined  ranges  for  the 
electric  fields.  Numerical  experiments  of  this 
proposed  model  are  also  presented  by  the  authors 
in  [8],  Although  this  is  a  first  approach  to  domain 
decomposition  models,  based  on  a  mesoscopic- 
macroscopic  linking,  our  results  seem  to  give  an 
improvement  (see  also  [13])  of  computational 
efficiency,  relative  to  the  kinetic  computations  of 
Baranger  and  Wilkins  [5].  Moreover,  the  model 
itself  seems  to  be  an  improvement  over  the 
augmented  drift-diffusion  model  used  by  Kan, 
Ravaioli  and  Kerkhoven  [20].  This  model,  based 
on  a  model  introduced  empirically  by  Thornber 
[25],  employs  necessary  corrections  to  the  usual 
macroscopic  approach  to  high  field  models. 


3.  BOLTZMANN-POISSON  SYSTEM 
SCALES 

Finally,  we  concentrate  on  the  derivation  of  the 
macroscopic  models  for  semiconductors,  based  on 
asymptotic  expansions  of  multiscale  dimensionless 
BTE-Poisson  systems.  For  the  sake  of  simplicity 


we  shall  consider  only  electrons.  The  semi  classical 
Boltzmann-Poisson  system,  within  a  parabolic 
band-relaxation  time  approximation,  may  be 
written: 

d,F+  v  ■  VXF--E(x,  t )  •  VvF= 

m  r 

E  =  -V0,  V  •  (£Vtf)  =  e((F)  -  Nd(x)),  (1) 

where  F(x,  v,  t)  is  the  density  function  for  the 
electron  at  position  x,  velocity  v  and  time  t.  The 
constants  e,  m,  r  and  e  represent  the  electric 
charge  constant,  its  mass,  the  relaxation  time 
reference  scale,  and  permittivity  of  the  material, 
respectively.  The  functions  <j>  and  E  of  (x,  t) 
represent  the  electrostatic  potential  and  its  field. 
The  bracket  (•)  denotes  the  usual  average  of  the 
distribution  function  with  respect  to  the  velocity 
variable.  Finally  M=M(v ,  x)  denotes  the  Max¬ 
wellian  centered  at  zero  velocity  with  0  =  6(x) 
representing  the  background  temperature  of  the 
lattice,  measured  in  units  of  specific  energy. 
Derivations  of  mesoscopic  limits  associated  with 
system  (1)  depend  on  a  particular  choice  of  space- 
time  scales,  collision  mechanism  scales,  and  drift 
velocity  scales  induced  by  self-referencing  field 
scales.  These  regimes,  as  we  shall  see,  can  make 
some  terms  of  the  BTE  dominant.  The  idea  that 
follows  then,  is  to  expand  the  solution  of  the  BTE 
about  the  solution  of  the  ground  equation  corre¬ 
sponding  to  the  dominant  terms.  Of  course,  this  will 
depend  on  the  form  of  scattering  mechanisms  that 
set  up  the  scales  of  the  collision  terms.  Lately,  a 
very  nice  survey  of  a  hierarchy  of  macroscopic 
models  for  semiconductors  under  limit  regimes 
that  correspond  to  dominant  collision  mechanisms 
has  been  presented  in  [1].  These  macroscopic 
models  correspond  to  classical  drift-diffusion  and 
so-called  energy  transport  models.  They  arise  from 
expansion  about  kernels  of  the  collision  operators, 
which  are  field  independent,  so  they  do  not 
incorporate  the  scales  of  the  external,  or  even 
self-induced  electric  field.  Dilfusions  are  field 
independent,  and  consequently  they  cannot  ap¬ 
proximate  the  ballistic  electron  distribution  func- 
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tions  that  have  been  observed  by  Barenger  and 
Wilkins  [4,  5].  After  the  work  initiated  by  [14] 
and  [22]  on  strong  forcing  scaling  for  external 
fields,  where  the  dominant  term  in  the  scaled  Eq. 
(1)  is  given  by  the  balance  of  the  force-drift  (< e/m ) 
E(x ,  /)VvFand  the  collision  terms,  the  following  3- 
scale  dimensionless  formulation  has  been  formu¬ 
lated  in  [9]  in  i  Euclidean  dimensions: 

nF,  +  v  •  VXF  +  5  S7X4>  -VvF=X-{{F)M-  F}, 

^  =  l(^(F)-Nd(x)^,  (2) 

where  e  —  is  the  scaled  mean  free  path  for  a 
length  scale  L  and  relaxation  time  r,  and  6  is  the 
reference  scale  of  the  background  energy.  The 
dimensionless  constant  77  =  U/Q 1/2,  where  V ,  in 
units  of  velocity,  is  given  by  22^,  takes  into 
account  the  reference  scale  of  the  local  electric 
field.  Finally,  the  dimensionless  constant  7  =  Mis¬ 
represents  the  scaled  Debye  length,  where  p  scales 
the  density  of  the  fixed  background  Nd(x)  . 

Now,  a  strong  force  regime  corresponds  to 
values  of  77  =  0(1),  7  =  0(1)  and  e<L .  In  this 
case,  system  (2)  is  obtained,  where  the  distribution 
function  F  is  scaled  with  the  drift  velocity  U 
(instead  of  the  low  field  scalings  that  use  the 
thermal  velocity  scale  for  F).  The  time  scale  is  then 
fixed  at  LU~X. 


4.  ASYMPTOTIC  LIMITS 

AND  THE  HIGH  FIELD  MODEL 

Clearly,  any  asymptotic  expansion  is  justified 
under  the  assumption  that  the  scaling  does  not 
break  down.  Thus,  any  expansion  of  the  BTE- 
Poisson  system  (2)  requires  that  the  scale  of  the 
internal  field  defined  by  the  Poisson  equation  (i.e., 
depending  on  the  value  of  7)  does  not  override  the 
scaling  of  the  BTE. 

p  is  the  reference  scale  of  the  predefined  Nd{x) 
and  will  change  scale  whenever  Nd(x)  does. 
Accordingly,  so  will  the  local  field.  Therefore,  as 


an  immediate  consequence  of  the  fact  that 
7  =  7 (p)  can  be  easily  checked  for  GaAs  (as  in 
[5,  20]),  the  dimensionless  quantities  vary  in  the 
N+  regions  from  the  N  regions.  Thus,  in  N+  re¬ 
gions,  77  =  e  =  o(l).  Because  of  scaling  times  and 
the  distribution  function  F9  accordingly,  (2)  yields 
the  Drift-Diffusion-Energy  Transport  regime  as 
the  relative  electric  field  remains  moderate  and  the 
collision  mechanisms  become  dominant. 

However,  in  the  N  region  it  is  computed  by  the 
authors  in  [8]  that  e«(l/7)  and  77  =  0(1).  Thus, 
while  7  remains  of  order  0(1),  it  has  been  shown 
that  densities  and  the  electric  field  satisfy  the 
equations, 

p,  +  V*/=0, 

J  =  _r[v;c(p((9  +  p2E ®  £))]  -  ppE 
+rp,EpV  •  (p,E)  -  Tl1P~  (PPE  ~  w), 

V  •  w  =  0,  curl  w  =  pE  (8)  Vp, 
E=~V<t>,  V-{eV</>)  =  (p-N<(x))9  (3) 

in  their  dimensionalized  formulation.  Here  p  is  the 
mobility.  The  new  variable  w  can  be  interpreted  as 
the  curl  of  a  magnetic  field,  associated  with  the 
high  electric  current.  This  model  is  a  correction  of 
the  one  proposed  by  Thornber  [25],  where  the 
diffusion  and  transport  coefficients  were  given 
empirically. 

We  point  out  that  in  this  relaxation  time 
approximation,  the  mobility  coefficient  is  given 
by  p  =  %  so  clearly  the  saturation  of  the  mobility 
depends  on  how  the  scattering  rates  scale  for  r 
changing  with  respect  to  the  electric  field.  The 
standard  assumption  based  on  curve  fitting  is  to 
take  n  =  C,/(  1  +  C2|£|2)1/2  (see  [16]). 

We  stress  that  this  regime  is  valid  only  for  the 
strong  force  scaling.  The  transition  regime  from 
weak  to  strong  forcing  scaling,  and  conversely, 
ought  to  be  done  by  carefully  considering  the 
corresponding  kinetic  transition  layers.  The  im¬ 
plementation  and  solution  of  this  transition 
problem  require  solution  of  boundary  value 
problems  at  the  kinetic  and  macroscopic  level  as 
well. 
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5.  DOMAIN  DECOMPOSITION 

AND  TRANSITION  LAYERS 

Appropriate  boundary  data  at  the  kinetic  level  and 
the  corresponding  limiting  data  for  the  mesoscopic 
approximation  are  presented  in  [9].  For  kinetic 
level  distribution  functions  there  are  prescribed 
exact  neutral  space  charge  conditions  at  contact 
boundaries  and  standard  insulating  boundary 
conditions  at  insulating  walls.  Then,  these  condi¬ 
tions,  together  with  the  asymptotic  approxima¬ 
tions,  yield  Dirichlet  boundary  conditions  at  the 
macroscopic  level,  with  a  correction  term  of  the 
order  of  the  asymptotic  parameter  (i.e.,  the  order 
of  the  scaled  mean  free  path  e).  The  contact 
condition,  at  the  kinetic  level,  on  a  space  boundary 
section  T  reads  [9]: 

r--{™-LF**)idz*  (4> 

on  v  •  n  <  0, 

where  n  denotes  the  outer  normal  with  respect  to 
the  boundary  section  T,  and  P  denotes  the  lowest 
order  term  in  the  expansion  for  either  weak  force 
scaling  (i.e.,  P  is  a  Maxwellian)  or  strong  force 
scaling  (i.e.,  P  peaks  asymmetrically  and  is  electric 
field  dependent). 

The  corresponding  macroscopic  condition  for 
the  inner  boundary  can  be  computed  explicitly  [9] 
and  yields 


p(x)  =  Nd(x)  +  o(e)K(x), 


K(x)  = 


(vF^)  •  njF-l 

(v  •  F(?))  -  n 


(5) 


regions.  Hence,  at  boundary  ohmic  contacts  on 
N+  regions,  condition  (5)  is  almost  a  neutral  space 
charge  condition  at  the  macroscopic  level,  as 
historically  has  been,  and  still  is  typically  pre¬ 
scribed.  However,  at  the  transition  layer  from  the 
iV+to  the  N  regime,  the  transfer  of  the  data  entails 
a  kinetic  computation  using  condition  (4);  then, 
one  prescribes  an  exact  background  density,  and 
thus  delivers,  at  the  next  level,  for  the  strong  force 
model,  a  condition  for  the  density  that  is  an  0(e) 
derivation  from  the  background  density.  This  fact 
can  actually  be  observed  in  the  computations  of 
Barenger  and  Wilkins  [4,  5], 

Numerical  implementations  of  strong-weak 
forcing  decomposition  are  presented  by  the 
authors  [8]  in  this  same  issue.  However,  we  use 
there  a  matching  with  hydrodynamic  computa¬ 
tions  of  well  accepted  performance.  In  Figure  3  of 
[8],  we  present  a  graph  of  77,  which  clearly 
correlates  with  the  suggested  domain  decomposi¬ 
tion.  Theoretical  justifications  and  alternative 
numerical  methods  of  these  domain  decomposi¬ 
tion  ideas  are  underway  and  will  be  presented  in 
future  work. 
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Here, 

F  ~  F®  +  eF <•>  +  0(e2),  F®  =  (. F)P , 

</->  =  [  fdv. 

Jvn<  0 

We  finally  remark  that  the  Ohmic  contacts  on 
N+  regions  correspond  to  a  local  scaled  mean  free 
path  e  of  much  smaller  order  than  in  the  N 
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Smooth  quantum  hydrodynamic  (QHD)  model  simulations  of  the  resonant  tunneling 
diode  are  presented  which  exhibit  enhanced  negative  differential  resistance  (NDR)  when 
compared  to  simulations  using  the  original  0(tr)  QHD  model.  At  both  300  K  and  77  K, 
the  smooth  QHD  simulations  predict  significant  NDR  even  when  the  original  QHD 
model  simulations  predict  no  NDR. 

Keywords:  Quantum  hydrodynamic  model,  resonant  tunneling  diode 


The  original  0(h2)  quantum  hydrodynamic 
(QHD)  equations  have  been  remarkably  successful 
in  simulating  the  effects  of  electron  tunneling 
through  potential  barriers  including  single  and 
multiple  regions  of  negative  differential  resistance 
and  hysteresis  in  the  current-voltage  curves  of 
resonant  tunneling  diodes.  However,  the  model 
relies  on  an  ad  hoc  replacement  of  derivatives  of 
the  potential  with  derivatives  of  the  logarithm  of 
the  electron  density  in  order  to  avoid  infinite 
derivatives  at  heterojunctions. 

Refs.  [1]  and  [2]  present  an  extension  of  the 
QHD  model  that  is  mathematically  rigorous  for 
classical  potentials  with  discontinuities — as  are 
present  at  heterojunction  barriers  in  quantumse- 
miconductor  devices.  The  stress  tensor  in  this 
“smooth”  QHD  model  actually  cancels  the  leading 


singularity  in  the  classical  potential  at  a  barrier 
and  leaves  a  residual  smooth  effective  potential 
with  a  lower  potential  height  in  the  barrier  region 
(see  Fig.  1). 

The  smooth  QHD  equations  have  the  same 
form  as  the  classical  hydrodynamic  equations: 

dn  d  ,  ,  .  /1N 

di+d^i(nUi)~°  () 

d  .  .  ,  d  ,  D  x  dV  mnuj 

_  {mnUj)  +  —  (rnnuiuj  -  P,)  =  -n—-  — 

(2) 

-df+d^SUiW~UjPiJ  +  qi) 

dV  (W-lnTt)  1  ’ 

=  -nut- - - - - - - 

oXi  rw 
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FIGURE  1  Smooth  effective  potentials  £/,  UB ,  and  UP  for 
0.1  eV  double  barriers  at  300  K.  x  is  in  A 


where  repeated  indices  are  summed  over  and 
where  n  is  the  electron  density,  u  is  the  velocity, 
m  is  the  electron  mass,  Py  is  the  stress  tensor,  V  is 
the  classical  potential  energy,  W  is  the  energy 
density,  q  is  the  heat  flux,  and  T0  is  the  ambient 
temperature.  Collision  effects  are  modeled  by  the 
relaxation  time  approximation,  with  momentum 
and  energy  relaxation  times  rp  and  rw.  These 
transport  equations  are  coupled  to  Poisson’s 
equation  for  the  electric  potential. 

Quantum  effects  enter  through  the  expressions 
for  the  stress  tensor  and  the  energy  density: 


P  ij  —  nTSij 


h2n  d2V 
4 mT  dxjdxj 


(4) 


w=\nJ ’+5mm<2+^V2F  (5) 

where  T  =  1 :3  is  the  electron  temperature  and  the 
“quantum”  potential  V  is 


[  d3X'( - ^ - - 

J  \n(/3-0')(0  +  0>)h2 

xexpl - - t(X'  -  x)2\v(X'). 

I  (/3-W  +  /3')^2  ) 


The  quantum  correction  to  the  classical  stress 
tensor  and  energy  density  is  valid  to  all  orders  of 
h2  and  to  first  order  in  f3V,  and  involves  both  a 
smoothing  integration  of  the  classical  potential  over 
space  and  an  averaging  integration  over  tempera¬ 
ture.  The  barrier  height  B  is  incorporated  into  the 
QHD  transport  equations  (l)-(3)  by  replacing 
V  — >  V  +  B.  (Poisson’s  equation  is  not  changed). 

We  define  the  ID  smooth  effective  potential  in 
the  momentum  conservation  equation  (2)  as  the 
most  singular  part  of  V-Pn: 


£/=  V  + 


a2  </2r 

4 mT  dx 2  ’ 


(7) 


The  double  integration  over  both  space  and 
inverse  temperature  provides  sufficient  smoothing 
so  that  the  Pn  term  in  the  smooth  effective 
potential  cancels  the  leading  singularity  in  the 
classical  potential  at  a  barrier. 

The  ID  steady-state  smooth  QHD  equations  are 
discretized  [3]  using  a  conservative  upwind  method 
adapted  from  computational  fluid  dynamics.  The 
discretized  equations  are  then  solved  by  a  damped 
Newton  method. 

There  are  two  contributions  to  the  quantum 
potential  V:  the  double  barrier  potential  and  the 
“self-consistent”  electric  potential  from  Poisson’s 
equation.  Note  that  second  derivatives  of  V  appear 
in  the  stress  tensor  and  energy  density,  which  then 
are  differenced  in  the  smooth  QHD  transport 
equations.  Thus  we  compute 


V"  =  V "  +  Vp. 


(8) 


VnB  is  just  computed  once  since  it  only  depends  on 
the  barriers  and  not  on  the  applied  voltage  or  state 
variables  («,  w,  T,  VP).  In  computing  V%  we  first 
use  Poisson’s  equation  to  obtain 


hr 


2  m/3 


t r(/3 -/?')(/? +  /3')^ 
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x  exp 


2m  (3 


(Nd(X'  +  x)  -n{X'  +  x)) 


X'2} 


where  ND  is  the  density  of  donor  ions.  Then  to 
efficiently  compute  the  convolution  (9),  we  take 
advantage  of  properties  of  the  Fourier  transform. 

We  present  simulations  of  a  GaAs  resonant 
tunneling  diode  with  Al^Gaj.^As  double  barriers 
at  300  K  (77  K).  The  barrier  height  B  is  set  equal 
to  0.1  (0.05)  eV.  The  diode  consists  of  n+  source 
(at  the  left)  and  drain  (at  the  right)  regions  with  the 
doping  density  ND  =  1018  cm-3,  and  an  n  channel 
with  Nd  =  5  x  1015  cm-3.  The  channel  is  200 
(250)  A  long,  the  barriers  are  25  (50)  A  wide,  and 
the  quantum  well  between  the  barriers  is  50  A 
wide.  Note  that  the  device  has  50  A  spacers 
between  the  barriers  and  the  contacts.  We  have 
chosen  parameters  to  highlight  differences  between 
the  original  and  smooth  QHD  models. 

Figure  1  illustrates  the  smooth  effective  poten¬ 
tials  UB  (for  the  barriers),  XJP  (the  Poisson  con¬ 
tribution),  and  U  (barrier  plus  Poisson  contribu¬ 
tions)  for  the  resonant  tunneling  diode  at  300  K  at 
the  voltage  V  =  0.056  where  the  I-  V  curve  peaks 
in  Figure  2. 

Smooth  QHD  simulations  of  the  resonant 
tunneling  diode  exhibit  enhanced  negative  differ¬ 
ential  resistance  when  compared  to  simulations 


using  the  original  0{%2)  QHD  model.  The  current- 
voltage  curve  for  the  resonant  tunneling  diode  at 
300  K  is  plotted  in  Figure  2  and  at  77  K  is  plotted  in 
Figure  3.  It  is  interesting  that  the  original  0(h2) 
QHD  model  (see  Refs.  [4,  3]  and  references  therein) 
predict  very  different  /-  V  curves — in  fact,  at  both 
300  K  and  77  K  the  original  0{Ti2)  QHD  model  fails 
to  produce  negative  differential  resistance  for  these 
devices. 

Simulations  of  the  resonant  tunneling  diode  using 
the  Wigner-Boltzmann/Poisson  equations  are 
planned  to  determine  which  of  the  QHD  models 
gives  better  agreement  with  the  more  complete 
quantum  kinetics.  In  these  comparisons,  we  will  use 
Fokker-Planck  collision  terms  with  a  relaxation 
time  r  in  the  Wigner-Boltzmann  equation,  which 
then  implies  momentum  and  energy  relaxation 
times  rp  —  r  and  rw  —  r/2  in  the  QHD  models. 

We  will  then  be  able  to  answer  the  question;  In 
what  parameter  range  and  how  accurately  do  the 
smooth  QHD  solutions  (first  three  moments)  reflect 
the  solutions  to  the  full  Wigner-Boltzmann  equa¬ 
tion?  There  should  be  a  technologically  important 
range  of  parameters  (device  size,  ambient  tempera¬ 
ture,  potential  barrier  height,  applied  voltage, 
semiconductor  material,  etc.)  in  which  the  smooth 
QHD  model  gives  solutions  and  I-  V curves  that  are 
very  close  to  those  given  by  the  full  Wigner- 
Boltzmann/Poisson  system. 


FIGURE  2  Current  density  in  kiloamps/cm2  vs.  voltage  for 
the  resonant  tunneling  diode  at  300  K.  The  solid  curve  is  the 
smooth  QHD  computation  and  the  dotted  line  is  the  0(H2) 
computation.  The  barrier  height  is  0.1  eV. 


FIGURE  3  Current  density  in  kiloamps/cm2  vs.  voltage  for 
the  resonant  tunneling  diode  at  77  K.  The  solid  curve  is  the 
smooth  QHD  computation  and  the  dotted  line  is  the  0(h2) 
computation.  The  barrier  height  is  0.05  eV. 
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Spherical  Harmonic  Modeling  of  a  0.05  pm 
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We  perform  a  rigorous  comparison  between  the  Spherical  Harmonic  (SH)  and  Monte 
Carlo  (MC)  methods  of  solving  the  Boltzmann  Transport  Equation  (BTE),  on  a  0.05  pm 
base  BJT.  We  find  the  SH  and  the  MC  methods  give  very  similar  results  for  the  energy 
distribution  function,  using  an  analytical  band-structure,  at  all  points  within  the  tested 
devices.  However,  the  SH  method  can  be  as  much  as  seven  thousand  times  faster  than  the 
MC  approach  for  solving  an  identical  problem.  We  explain  the  agreement  by  asymptotic 
analysis  of  the  system  of  equations  generated  by  the  SH  expansion  of  the  BTE. 


Keywords:  Device  Modeling,  Simulation,  Spherical  Harmonic,  Boltzmann,  Monte  Carlo 


1.  INTRODUCTION 

The  Spherical  Harmonic  (SH)  method  for  solving 
the  BTE  has  given  rise  to  considerable  contro¬ 
versy.  Researchers  developing  the  method  have 
reported  promising  results  [1-3].  Other  research¬ 
ers  have  questioned  whether  the  SH  method  can 
indeed  provide  the  distribution  function  [4].  Still 
others  have  called  for  a  study  of  various  methods, 
including  the  SH,  especially  under  conditions  of 
rapid  spatial  variations  [5]. 

In  this  paper,  we  respond  to  these  questions  by 
performing  a  rigorous  comparison  between  the  SH 
and  MC  methods  of  solving  the  Boltzmann 
equation,  while  using  a  0.05  pm  base  BJT  as  a 


test  vehicle.  We  find  the  SH  and  the  MC  methods 
give  very  similar  results  for  the  energy  distribution 
function,  using  an  analytical  band-structure ,  at  all 
points  within  the  tested  devices .  However,  the  SH 
method  can  be  as  much  as  seven  thousand  times 
faster  than  the  MC  approach  for  solving  an  identical 
problem . 

We  explain  the  agreement  between  SH  and  MC 
by  analysis  of  the  SH  system  of  equations. 
Asymptotic  analyses  show  that,  due  to  high 
scattering  rates,  high  order  SH  terms  decay 
quickly  enough  to  have  minimal  effect  on  the 
energy  distribution  function,  even  for  abrupt  large 
spatial  variations.  Additionally,  boundary  condi¬ 
tions  further  restrict  the  population  of  higher 
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order  terms,  thereby  facilitating  use  of  a  low  order 
SH  expansion. 

2.  OVERVIEW  OF  SPHERICAL 
HARMONIC  METHOD 

Since  the  main  focus  of  this  paper  is  to  demon¬ 
strate  and  explain  why  the  SH  method  can  give 
relatively  good  agreement  with  MC  simulations, 
we  begin  with  an  overview  of  the  SH  approach. 
First,  we  write  the  BTE  below: 

Jv*e.Vr/(?,£)+!  V,  <f>(r )  •  V*  /(?,  f ) 

=7TT3  £  / [f(r,k')Si(k',k) 
(z7r)  i  J 

-f(?,k)Si(k,k'  )\d3k' 

(1) 

where  (j>{r)  is  the  potential,  f(ryk)  is  the  electron 
distribution  function;  k  is  the  electron  wave- 
vector;  r  is  the  position  vector;  %  is  Planck’s 
constant,  and  the  sum  is  over  the  various  collision 
processes  represented  by  the  transition  rates  St. 
Now,  we  express  the  distribution  function  in  terms 
of  the  following  SH  expansion: 

mk) = £  (2) 

/=0  m=-l 

here  Yf(0,ip)  are  the  spherical  harmonic  basis 
functions,  6  and  ^  are  the  polar  and  azimuthal 
angles  of  the  wave- vector,  respectively.  ff(r,e) 
represents  the  expansion  coefficients.  The  eventual 
goal  of  the  approach  is  to  determine  the  coeffi¬ 
cients  and  thereby  the  distribution  function  for  the 
device.  We  showed  previously  that  by  substituting 
the  spherical  harmonics  expansion  for  the  dis¬ 
tribution  function  into  the  BTE,  while  taking 
advantage  of  the  recurrence  and  orthogonality 
relationships  between  spherical  harmonics  the 
LHS  of  the  BTE  can  be  transformed  into  an 
infinite  set  of  identical  expressions  for  the  coeffi¬ 
cients.  We  also  showed  the  if  we  take  scattering  to 
be  either  isotropic  or  elastic  (which  is  a  reasonable 


approximation  in  silicon),  and  make  use  of  the 
addition  theorem,  the  collision  integral  can  be 
transformed  into  a  relatively  simple  expression 
[1,  2].  Putting  the  LHS  and  the  collision  integral 
together  gives  the  following  equation  for  the  zero 
order  coefficient /$: 


E  v(£)  7T-  -  qEi{r) 


l-  W(e) 


*:}a 


jMl 


C(£) 


While  the  expressions  for  the  coefficients  for  />0 


+v(e)[^-^(F)(l+- 

MM 


(d  /-l7'(£)\1 
\<9£  2  j(e)  )\  ‘ 

^+L±im)]Ar 

9e+  2  7(e)  Al  ' r' 


where  v(£)  =  y/lm^e) / m j  f(e) ,  7  '(e)  =  drf{e)/de, 
7(5)  represents  the  dispersion  relation,  and  2?*(r)  is 
the  electric  field  in  the  i  direction.  The  sum  over  i 
represents  the  Cartesian  directions  of  a  2-D  device 
cross-section  in  real-space.  The  raising  and  low¬ 
ering  operators  af  which  we  developed  to  relate 
the  coefficients  are  [1]: 


«r/r 


aatfT  MCttr/w'-WC) 


n~  m  rm  .  -+  rn 
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1  _  V 
(£)  (2tt): 


f  ^-0,  e')h(e')de' : 
7 rj  J 

31 J  Sjp' ,e)h{e')de' 


COMPARING  SPHERICAL  HARMONIC/MONTE  CARLO 


149 


In  (5)  we  have  introduced  which  is  /'th 
coefficient  of  the  Legendre  polynomial  expansion 
of  the  z'th  type  of  scattering  mechanism  [2],  as  well 
as  the  density  of  states  h(s)  =  4V^7r//z3(m*3/271//2 
(e)7'(£)).  The  superscript  out  refers  to  a  process 
where  the  electron  is  scattered  out  of  the  state  k  to 
another  state  k1 \  while  the  superscript  in  represents 
scattering  into  k.  The  superscripts  =F  in  (5)  refer  to 
absorption  (— )  and  emission  (  +  )  of  energy 
respectively.  (For  elastic  processes  have  been 
eliminated,  since  there  is  no  energy  emission  or 
absorption.  In  cases  where  there  is  no  process  for 
energy  emission  (absorption),  the  superscript  + 
(— )  has  been  eliminated). 

While  a  system  of  equations  has  been  obtained 
to  arbitrarily  high  order,  we  must  truncate  the 
system  to  actually  obtain  a  solution.  Since  the 
scattering  rate  in  silicon  is  relatively  high,  and 
silicon  is  a  covalent  semiconductor,  a  relatively 
low  order  truncation  should  be  applicable.  (This 
truncation  will  be  verified  later  in  the  paper).  We 
therefore  use  an  expansion  which  includes  the  first 
four  spherical  harmonics  (7°,  Tj-1,  ^i)-  After 

truncating,  the  total  energy  H  is  used  as  the 
independent  energy  variable  as  opposed  to  the 
kinetic  energy  e,  ( H=E-q<f> )  [3].  We  can  then 
arrive  at  the  following  tractable  expression  for  the 
symmetrical  part  of  the  distribution  function 
throughout  the  device  [1,  2]: 


-V- 


73/2(i/  +  q<p)r(H  +  q<j)) 


VF00(x,y,B) 


1  ’(H+qcp) 

3(g  —  l)P2nm*5/2  /  71/2(g+g<^)7/(i?+#)\ 
2V2-JT n3pcjn  V  etiu„/KoT_l  J 

{  ^ j(H  +  q4>  -  hun)l'{H  +  q4>  -  hon) 
x  [F°(x,  j,  H  -  hujn)  -  ^rF°0(x,y,  H )] 

+  \J +  q&  +  hujn) 

x  7 '(H+q^+ttw^e^^F^y,  H+hu>n) 
-F°0(x,y,H)}}  (6) 


In  (6)  Dn  =  optical  phonon  deformation  potential; 
un  =  optical  phonon  vibrational  frequency;  = 


symmetrical  part  of  electron  distribution;  1/r  is 
the  total  phonon  scattering  rate. 


3.  BJT  SIMULATION  RESULTS 

AND  DISCUSSION  OF  TRUNCATION 

The  main  point  of  this  paper  is  to  demonstrate 
that  a  low  order  truncation  is  indeed  suitable  for 
determining  the  energy  distribution  function  in 
semiclassical  silicon  devices  under  the  spherical 
band  approximation.  To  show  this,  we  used  the 
above  formulation  to  simulate  a  0.05  pm  base  BJT. 
We  chose  the  BJT  because  it  represented  a  very 
aggressive  challenge,  with  electric  fields  pointing  in 
both  directions,  a  narrow  base  of  50  nm,  with 
fields  varying  as  rapidly  as  200  kV/cm  over 
distances  as  small  as  20  nm.  As  a  benchmark,  we 
also  simulated  the  same  device  using  the  Monte 
Carlo  approach.  The  input  parameters  (analytical 
band  structure,  intervalley,  optical  and  acoustic 
phonon  scattering)  were  identical  for  both  the  SH 
and  MC  calculations.  Since  the  main  objective  was 
to  determine  if  the  low-order  SH  method  could 
respond  to  the  rapid  field  variations  of  the  BJTs, 
we  performed  both  the  Monte  Carlo  and  the 
Spherical  Harmonic  calculations  as  post-processor 
simulations. 

We  show  some  example  results  below.  In 
Figures  la  and  lb  we  show  the  doping  profile 
and  resulting  electric  field  with  an  applied  poten¬ 
tial  of  Vbe~  1*0  V  and  KCB  =  3.0V.  In  Figures  2a 
and  2b  we  show  the  3-dimensional  result  for  the 
energy  distribution  function  calculated  along  the 
device.  From  a  qualitative  point  of  view,  both 
distributions  look  alike,  although  the  one  gener¬ 
ated  by  the  SH  approach  is  populated  over  the 
complete  range  of  the  calculation  whereas  the  MC 
results  does  not  have  data  for  high  energies  due  to 
the  time  consuming  stochastic  nature  of  the  MC 
process.  In  Figures  3a  and  3b  we  show  the  energy 
distribution  in  more  detail  by  plotting  its  values  at 
regular  intervals  along  the  device.  In  the  plots,  we 
compare  the  SH  results  (solid  lines)  with  the  MC 
results  (open  circles).  Clearly,  the  SH  and  MC 
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FIGURE  1  The  doping  concentration  (top)  and  electric  field 
(bottom)  for  the  0.05  pm  /7-base  Gaussian  doped  BJT  with 
FBE  =1-0  Yand  VCB  =  3.0  V. 


FIGURE  2  The  calculated  electron  distribution  function  by 
SH-BTE  (top)  and  MC  (bottom)  for  the  0.05  pm  /7-base 
Gaussian  doped  BJT. 


Energy  (ev) 


FIGURE  3  Agreement  of  the  electron  distribution  function 
between  SH-BTE  and  MC  for  the  0.05  pm  />-base  Gaussian 
doped  BJT. 


results  are  in  very  good  agreement  for  the  entire 
distribution  function.  We  find  this  to  be  an 
extremely  encouraging  result,  considering  that 
the  SH  method  required  only  10  sec  to  evaluate, 
while  the  MC  approach  took  more  than  10  hours 
on  a  DEC  Alpha  266  MHz  workstation.  The  large 
difference  in  CPU  time  is  attributable  to  the 
deterministic  nature  of  the  SH  approach,  while 
the  MC  method  requires  considerable  time  to 
accumulate  reasonable  statistics  for  the  high 
energy  tail  and  to  surmount  the  large  potential 
barrier  at  the  emitter-base  junction.  (The  MC 
could  probably  be  optimized  but  our  attempts  at 
statistical  enhancement  proved  unreliable  when 
subjected  to  scrutiny). 
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We  explain  the  agreement  between  SH  and  MC 
by  asymptotic  analysis.  First  of  all,  for  very  large 
values  of  SH  index  l,  1+ 1  ~  /,  and  F/+  j  ~  F>. 
Under  these  conditions,  Eq.  (4)  reduce  to  the 
following  relatively  simple  expression. 


v(#  +  #) 


dF„(x,H) 

dx 


Fi(x,H) 
A  (H+q<f>) 


(7) 


where  A (H+q<p)  =  v(H+  q(p)r(H+  qp)  is  a  mean 
free  path,  the  value  of  which  depends  on  energy. 
For  large  energy  A  becomes  approximately  con¬ 
stant,  while  for  lower  energies  A  can  be  approxi¬ 
mated  as  the  following  piecewise  linear  expression: 
A (H+q<(>)  =  a  ■  (H  +  q4>)  +  p  within  [x0,  x]  for  a 
constant  H.  By  integrating  (7)  over  position  space 
from  x0  to  x  for  a  constant  H,  using  the  chain  rule 
dx  =  %de,  as  well  as  approximating  E(x)  as 
constant  in  the  small  interval,  we  can  obtain 


F,(x,H)  =  F,(x0,H) 


A  (H+q<f>) 

A (H  +  qcpo) ) 


which  simplifies  to  the  following  expression  in  the 
high  energy  region  where  A  is  constant. 


F/(x,H )  =  F,(x0,  H )  exp  ^  -  *  A*°  ^  (9) 

In  high  energy,  (9)  shows  that  F\  will  decay 
exponentially  in  x-space  with  a  characteristic 
length  of  A.  Depending  on  the  transport  model 
one  is  using,  A  ranges  from  approximately  2  nm  to 
5  nm  for  energies  greater  than  1.5  eY  in  silicon. 
This  indicates  that  high-energy  spherical  harmo¬ 
nics  decay  very  quickly  in  comparison  with  typical 
dimensions  in  deep  submicron  devices.  For  regions 
of  low  energy  and  low  electric  field,  (8)  ensures 
that  Ft  also  decays  very  rapidly  with  distance. 
(While  this  is  not  immediately  obvious,  it  is  due  to 
the  relationship  between  <j>  and  E,  and  the  fact  that 
A  is  monotonically  decreasing,  ensures  the  expon¬ 


ent  in  (8  becomes  very  large,  and  the  term  in  the 
square  brackets  is  always  less  than  unity).  For  low 
to  moderate  energies,  where  there  is  a  transition 
from  a  low  electric  field  to  a  high  one,  or  visa- 
versa,  the  monotonic  nature  of  A  again  insures  that 
the  square  bracketed  term  in  (8)  will  be  small. 

In  addition  to  the  asymptotic  analysis  for  /, 
physical  boundary  conditions  also  suppress  the 
magnitude  of  high  order  terms.  For  example,  the 
physical  requirement  that  the  distribution  function 
be  single  valued  at  the  energy  e  =  0  (is  not  dependent 
on  angle  at  zero  energy),  leads  to  the  requirement 
that  all  SH  components  with  />0  must  vanish  at 
zero  energy.  Furthermore,  the  requirement  that  the 
distribution  function  is  close  to  Maxwellian  at  the 
device  contacts  implies  that  higher  order  SH  terms 
are  negligible  at  these  boundaries.  Also,  since  in 
silicon  the  scattering  rate  is  fairly  high,  and  since  it  is 
a  covalent  semiconductor,  we  usually  take  the  major 
scattering  rates  to  be  isotropic.  This  leads  to 
scattering  which  tends  to  give  rise  to  a  distribution 
function  which  is  largely  spherical  in  nature. 
Finally,  as  one  examines  the  generalized  SH  system 
given  by  Eqs.  (3)  and  (4),  it  is  clear  that  the  equations 
are  only  coupled  to  nearest  neighbors  in  L  It  is 
therefore  difficult  to  propagate  information  from 
low  order  terms  to  higher  order  ones,  thereby 
minimizing  the  relative  importance  of  higher  order 
SH  coefficients. 
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Photon  recycling  is  examined  as  an  explanation  for  the  observed  large  carrier  lifetimes  in 
an  InP/InGaAs  photodiode.  This  effect  extends  the  effective  carrier  lifetime  within  a 
device  by  re-absorbing  a  fraction  of  the  photons  generated  through  radiative  band-to- 
band  recombination  events.  In  order  to  predict  the  behavior  of  this  carrier  generation, 
photon  recycling  has  been  added  to  our  two-dimensional  macroscopic  device  simulator, 
STEBS-2D.  A  ray- tracing  preprocessing  step  is  used  to  map  all  of  the  possible 
trajectories  and  absorption  of  various  wavelengths  of  emitted  light  from  each 
originating  node  within  the  device.  The  macroscopic  simulator  uses  these  data  to 
determine  the  spatial  location  of  the  re-absorbed  radiation  within  the  geometry  of  the 
device.  By  incorporating  the  ray  tracer  results  with  the  total  quantity  and  spectral 
content  of  recombined  carriers  at  each  node  within  the  simulation,  the  recycled 
generation  rate  can  be  obtained.  A  practical  application  of  this  model  is  presented  where 
the  effects  of  photon  recycling  are  used  as  a  possible  explanation  of  the  discrepancy 
between  the  theoretically  predicted  and  experimentally  observed  radiative  recombina¬ 
tion  rate  in  a  double  heterostructure  photodetector. 


Keywords:  Photon  recycling,  reabsorption,  self-excitation,  macroscopic  simulation 


1.  INTRODUCTION 

It  is  often  observed  that  radiative  band-to-band 

transitions  are  the  dominant  recombination  mec¬ 
hanism  in  many  high  quality  direct  gap  semiconduc¬ 
tors.  Those  carriers  which  recombine  radiatively 
conserve  energy  through  the  liberation  of  a  photon 
with  a  characteristic  energy  near  the  semiconductor 


bandgap.  Photon  recycling  is  the  reabsorption  of 
this  emitted  light  elsewhere  in  the  device  to  produce 
additional  electron-hole  pairs.  Thus,  photon  recy¬ 
cling  can  be  described  as  an  additional  generation 
mechanism  which  is  tightly  coupled  to  the  total 
amount  of  radiative  recombination  over  the  range 
of  the  device.  The  effectiveness  of  photon  recycling 
depends  on  many  factors  such  as  the  dominance  of 
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the  radiative  recombination  mechanism  and  the 
light  confinement  properties  of  the  structure. 

The  influence  of  photon  recycling  on  the  per¬ 
formance  of  many  devices  has  been  investigated  [1  - 
3],  and  in  several  cases  it  has  been  shown  that 
neglecting  this  effect  leads  to  improper  analysis 
of  the  carrier  transport.  Therefore,  a  general  model 
including  the  influence  of  photon  recycling  should 
be  included  within  advanced  macroscopic  device 
simulators.  Unfortunately,  many  of  the  models  used 
to  analyze  photon  recycling  have  relied  on  either 
one-dimensional  or  analytical  approximations  to 
estimate  the  distribution  of  the  generated  carriers, 
and  general  multi-dimensional  models  don’t  exist. 

The  primary  impetus  behind  this  study  is  to 
develop  a  fully  numerical,  general  two-dimensional 
photon  recycling  model  which  can  be  directly 
coupled  into  the  STEBS-2D  [4]  macroscopic  device 
simulator.  To  achieve  the  desired  result,  ray 
tracing  techniques  are  employed  to  monitor  the 
trajectories  and  absorption  of  light  propagating 
through  the  device  geometry.  Since  the  optical 
properties  of  the  devices  of  interest  are  spatially 
inhomogeneous,  and  since  the  generated  light  from, 
the  recombination  events  has  a  non-uniform 
spectral  content,  the  ray  tracer  must  be  used  to 
map  the  absorption  of  light  originating  from  any 
point  in  the  device  over  a  range  of  wavelengths. 

As  an  example  of  the  utility  of  the  model,  an 
InP/InGaAs  double  heterostructure  photodiode 
[5]  is  examined  to  determine  whether  the  photon 
recycling  phenomenon  can  be  used  to  explain  the 
discrepancy  between  theoretically  predicted  and 
experimental  lifetimes.  Experimentally,  the  lifetime 
is  measured  by  monitoring  the  magnitude  of  the 
collected  current  as  a  light  source  is  moved  across 
the  top  of  the  device.  Clearly  from  the  geometry  of 
the  structure  and  the  nature  of  the  experiment, 
one-dimensional  or  analytical  models  are  insuffi¬ 
cient  and  a  two-dimensional  analysis  is  necessary. 

2.  NUMERICAL  MODEL 

The  overall  goal  in  the  implementation  of  photon 
recycling  is  the  mapping  of  the  emission  of 
photons  at  one  point  to  their  absorption  at  all 


other  points  in  the  simulation  domain.  This 
requires  a  discrete  approach  at  the  positional  as 
well  as  the  spectrum  level.  The  spatial  discretiza¬ 
tion  of  the  simulation  domain  follows  that  of  the 
electrical  model,  while  the  continuum  of  emitted 
wavelengths  is  broken  into  several  discrete  energy 
ranges  representative  of  the  wavelengths  which 
could  originate  from  a  given  material. 

Ray  tracing  techniques  are  used  here  to  model 
the  propagation  of  light  through  the  crystal.  A 
general  purpose,  three  dimensional,  ray  tracing 
algorithm  based  on  the  idea  of  constrained 
volumes  is  used  to  provide  the  mapping  of  light 
emission  to  absorption  [6].  The  emitted  rays  are 
tracked  from  volume  to  volume,  and  the  distance 
through  each  is  used  to  determine  the  total 
absorption  at  each  node.  As  the  rays  move 
between  volumes  of  differing  material,  they  are 
refracted  and  secondary  reflected  rays  are  gener¬ 
ated.  Implicit  in  this  model  are  features  such  as 
wavelength  dependent  refraction,  critical  angle 
calculation,  wavelength  and  angle  dependent 
transmission  coefficients,  and  various  surface 
reflection  models  including  antireflection  coatings. 

The  ray  tracing  itself  is  fairly  straight  forward. 
From  each  node,  a  large  number  of  rays  are 
emitted  in  random  directions.  The  path  of  each  ray 
is  followed  until  either  the  ray  is  attenuated  or  is 
coupled  out  of  the  device  without  possibility  of 
return.  For  each  initiating  point,  the  detailed 
absorption  within  the  device  is  catalogued.  This 
process  must  be  repeated  for  each  simulated 
wavelength  owing  to  the  differing  absorption  and 
refraction  indices.  These  data  are  arranged  as  a  set 
of  cross-referenced  lists  relating  the  initiating  point 
of  emission,  the  points  contributing  to  the  absor¬ 
ption,  and  the  amount  of  attenuation. 

The  information  from  the  ray  tracing  is  in¬ 
corporated  into  the  source  term  of  the  current 
continuity  equations  of  the  macroscopic  simulator 
as  seen  in  the  flowchart  given  in  Figure  1.  Within 
each  iteration,  the  spectrum  of  the  spontaneously 
emitted  light  is  obtained  for  each  node,  (x, ;;).  The 
energy  dependent  spontaneous  emission  rate,  R ,  is 
estimated  through  the  use  of  the  van-Roosbroeck 
Shockley  relation  given  by  Eq.  (1) 
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FIGURE  1  Flowchart  depicting  steps  within  the  applied 
photon  recycling  model. 


the  output  from  the  ray  tracer,  the  total  photon 
recycling  generation  rate  at  any  point,  (/,  k )  can  be 
determined  by: 

Gm  =  J2  ^rad(*,.v) 

x,y  V  (W 

(3) 

where  r  is  the  ray  tracer  results  describing  the 
fraction  of  the  absorbed  power  at  (/,  k)  due  to 
emission  at  (, x ,  y )  for  wavelength  A  and  V(x,y)  and 
F(/5  k)  are  the  control  volume  areas  for  the 
originating  and  absorbing  elements. 

The  macroscopic  model  used  in  this  study  is  the 
drift-diffusion  subset  of  the  hydrodynamic  simu¬ 
lator.  Here,  Poisson’s  equation  is  coupled  to  the 
current  continuity  equations  for  inhomogenecus 
materials  [4].  By  including  the  generation  rate 
from  Eq.  (3)  into  the  continuity  equations  and 
iterating  until  a  self  consistent  solution  is  obtained, 
the  effects  from  photon  recycling  may  be  fully 
included  into  the  device  simulation. 


£/< 
L  A 


\x,y,X  )r(x,yJ,k,\) 


R(x,y)(v)dv 


87:  (hv)2n(v)2a(v) 


(1) 


where  A Ef  is  the  current  estimate  of  the  difference 
between  the  electron  and  hole  quasi-Fermi  levels, 
and  a  and  n  are  the  wavelength  dependent 
absorption  and  index  of  refraction  [7].  Next,  an 
estimate  of  the  total  radiative  recombination  rate 
at  the  node  is  determined  by: 


^RAD(x,y)  —  B(n{x.y)P(x,y)  ~  n ])  (2) 


where  the  radiative  constant  B  is  taken  to  be  the 
integral  of  Eq.  (1)  divided  by  «?.  The  normalized 
spectrum  produced  from  Eq.  (1)  is  divided  into 
non-overlapping  energy  ranges  centered  around 
the  discrete  wavelengths  simulated  by  the  ray 
tracer.  The  fraction,  f(x,y,\)>  of  the  overall  spon¬ 
taneous  rate  for  each  wavelength  is  taken  to  be 
the  integral  of  Eq.  (1)  over  each  energy  range  nor¬ 
malized  to  the  integral  of  the  total  spectrum.  Using 


3.  SIMULATION  RESULTS 

The  aforementioned  model,  included  in  the  macro¬ 
scopic  device  simulator,  is  used  to  examine  the 
affect  of  photon  recycling  upon  the  performance  of 
the  device  sketched  in  Figure  2.  For  this  device, 
excess  carriers  are  produced  from  a  1.3  micron 
monochromatic  light  source  ten  microns  in  dia¬ 
meter  moved  laterally  across  the  top  surface.  This 
light  is  absorbed  exclusively  in  the  InGaAs  with  an 
internal  quantum  efficiency  of  97  percent.  For 
the  device  in  question,  the  measured  minority 
carrier  lifetime  is  approximately  three  times 
larger  than  that  predicted  from  the  theoretical 
radiative  recombination  rate.  Photon  recycling  is 
conjectured  here  to  be  a  possible  cause  of  this 
discrepancy. 

The  two-dimensional  device  simulator  cannot 
accurately  capture  the  inherent  three-dimensional 
nature  of  the  diffusion  of  excess  carriers  in  the 
original  experiment.  Therefore  an  assumption 
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FIGURE  2  Illustration  of  device  geometry  and  doping 
profiles  for  the  photodiode  under  investigation.  The  device  is 
nominally  reverse  biased  to  five  volts  and  carrier  excitation  is 
through  photogeneration  from  the  laterally  moving  light 
source.  All  dimensions  are  in  microns  and  doping  in  cm-3. 


must  be  made  upon  the  value  of  the  non-simulated 
third  dimension  of  the  device.  The  calculated 
output  current  is  first  made  to  agree  with  the 
reported  results  using  the  experimentally  obtained 
lifetime  by  adjusting  the  level  of  carrier  injection 
by  an  appropriate  choice  of  length  for  the  third 
dimension.  Once  this  control  is  established,  the 
simulation  is  again  performed  using  the  lower, 
theoretically  predicted,  lifetime.  The  results  of  this 
simulation  are  shown  by  the  dashed  curve  in 
Figure  3  for  two  different  light  intensities.  Com¬ 
parison  of  the  experimental  results  to  the  calcula¬ 
tions  made  neglecting  photon  recycling  with  the 
theoretical  lifetime  clearly  show  that  the  output 
current  is  greatly  underestimated.  The  simulation 
is  also  performed  with  the  inclusion  of  photon 
recycling.  The  curve  marked  with  the  solid  squares 
in  Figure  3  shows  the  calculated  photocurrent 
including  photon  recycling  assuming  the  theore¬ 
tical  carrier  lifetime.  Comparison  of  the  experi¬ 
mental  results  to  the  calculations  including  photon 
recycling  shows  that  the  inclusion  of  the  additional 
photon  recycling  generation  rate  increases  the 
magnitude  of  the  collected  current  to  a  value 
clearly  resembling  that  of  experiment.  From  these 
results,  it  is  observed  that  photon  recycling 
provides  the  mechanism  to  describe  the  disparity 


0  100  200  300  400  500 


Distance  from  light  source  to  depl.  region  (microns) 

FIGURE  3  Calculated  and  experimental  collection  currents 
for  the  double  heterostructure  photodiode  under  2  pW  and  100 
pW  incident  optical  power.  Each  case  illustrates  the  calculated 
current  both  with  and  without  the  inclusion  of  photon 
recycling. 

between  the  experimental  and  theoretical  lifetimes 
observed  in  this  device. 


4.  CONCLUSIONS 

This  paper  describes  a  general  purpose  model  used 
to  include  the  effects  of  photon  recycling  into  a 
macroscopic  device  simulator.  A  three-dimen¬ 
sional  ray  tracer  is  used  to  predict  how  sponta¬ 
neously  emitted  light  propagates  and  is  absorbed 
within  a  semiconductor  device.  These  data  are 
then  coupled  to  a  two-dimensional  macroscopic 
device  simulator  and  are  used  in  conjunction  with 
the  quantity  and  spectral  content  of  the  radiative 
recombination  rate  to  predict  the  total  carrier 
generation  due  to  photon  recycling.  As  an  example 
of  the  model,  the  effect  of  photon  recycling  is 
examined  in  a  double  heterostructure  InP/InGaAs 
photodiode  where  it  is  observed  that  the  inclusion 
of  photon  recycling  within  the  InGaAs  restores  the 


PHOTON  RECYCLING 


157 


agreement  between  the  experimentally  and  theore¬ 
tically  obtained  values  of  carrier  lifetime. 
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The  use  of  a  wavelet  basis  can  lead  to  efficient  methods  for  performing  ab  initio 
electronic  structure  calculations  of  inherently  localized  structures.  In  this  work  wavelets 
are  used  to  construct  an  adaptive  basis  which  is  optimized  dynamically  throughout  the 
calculation.  The  computational  effort  of  such  a  method  should  scale  linearly  with  the 
number  of  basis  functions.  The  adaptive  basis  is  tested  for  the  case  of  bulk  Si  using  only 
a  local  s-pseudopotential. 


Keywords:  Wavelets,  multiresolution  analysis,  electronic  structure,  density  functional  theory, 
adaptive  basis 


INTRODUCTION 

Much  progress  has  been  made  in  the  area  of  ab 
initio  electronic  structure  calculations  using  the 
local  density  approximation  (LDA)  of  density 
functional  theory  (DFT).  Within  DFT  the  total 
energy  is  considered  to  be  a  functional  of  the 
electron  density.  The  electronic  ground  state  is 
found  by  minimizing  this  total  energy,  leading  to 
the  solution  of  the  Kohn-Sham  single-particle 
Schrodinger  equation. 

Despite  its  advantages,  the  commonly  used 
plane  wave  (PW)  basis  possesses  serious  draw¬ 
backs  when  applied  to  localized  systems.  Examples 
of  such  systems  include  those  with  deep  atomic 


pseudopotentials,  e.g.,  oxygen,  or  structures  of  a 
localized  nature,  e.g.,  molecules,  surfaces  and 
interfaces.  In  such  cases  a  prohibitively  high  energy 
cutoff  is  required  to  provide  the  necessary  resolu¬ 
tion.  Due  to  the  delocalized  nature  of  the  PW  basis 
functions,  this  increase  in  resolution  is  applied 
globally  even  though  the  increased  resolution  is 
needed  in  an  essentially  localized  region  of  space. 

As  a  result  of  these  problems  various  alternatives 
to  the  PW  method  have  been  proposed.  Most  can 
be  classified  as  real-space  methods.  These  include 
the  use  of  adaptive  curvilinear  coordinates  [1],  finite 
difference  [2]  and  multigrid  [3]  methods  as  well  as 
wavelet  bases.  It  is  the  use  of  a  wavelet  basis  which 
will  be  the  topic  of  this  paper. 


*Corresponding  author. 


159 


160 


D.  A.  RICHIE  et  al 


The  use  of  a  wavelet  basis  for  electronic 
structure  calculations  was  first  proposed  by  Cho 
et  al  [4]  where  non-orthonormal  wavelets  were 
used.  The  work  of  Wei  and  Chou  [5]  used  the 
compact  orthonormal  wavelets  constructed  by 
Daubechies  [6],  The  same  Daubechies  wavelets 
are  used  in  our  work.  In  addition,  both  Ref.  [4] 
and  Ref.  [5]  used  a  basis  that  was  predetermined 
by  simple  physical  arguments  and  remained  fixed 
throughout  the  calculation.  In  our  work  a  truly 
adaptive  basis  is  used  which  does  not  rely  on  any  a 
priori  assumptions,  but  instead  allows  the  calcula¬ 
tion  to  dynamically  determine  the  optimum  basis 
for  each  electronic  orbital.  Our  work  is  also  unique 
in  that  the  entire  calculation  is  performed  in  a 
wavelet  basis.  This  creates  the  possibility  for  a 
general  method  where  both  the  execution  time  and 
storage  scale  linearly  with  the  number  of  basis 
functions  used  to  represent  each  electronic  orbital. 

WAVELET  BASIS 

A  detailed  discussion  of  wavelets  can  be  found  in 
Ref.  [6].  A  wavelet  basis  forms  a  multiresolution 
analysis  (MRA).  Such  an  analysis  represents  a 
function  as  the  limit  of  increasingly  finer  approx¬ 
imations,  with  each  approximation  being  the  result 
of  smoothing,  or  averaging,  the  function  over 
some  fixed  length  scale.  The  result  is  a  series  of 
embedded  function  spaces  Vm  such  that 

•  •  •  Vm+ 2  C  Vm+ 1  C  Vm  C  Vm~\  C  V m—2  •  •  • 

A  basis  spanning  each  function  space,  or 
approximation  space,  can  be  constructed  from 
the  translations  of  some  localized  function  cj)(x ) 
known  as  the  scaling  function.  Thus,  a  basis  for 
the  approximation  space  Vm  can  be  written 
{<t>mn(x)  =  2m/2<f)(2mx  —  n),  n  e  Z}.  The  corres¬ 
ponding  characteristic  length  scale  is  ~  2~m.  The 
MRA  can  be  seen  to  be  generated  by  this  ‘mother 
scaling  function’  <j)(x).  The  choice  of  the  mother 
scaling  function  is  in  fact  the  beginning  of  the 
MRA  which  generates  a  wavelet  basis. 


Next,  consider  the  projection  of  a  function  onto 
two  successive  approximation  spaces  Vm  and  Vm+ x. 
Information  is  lost  when  going  from  the  finer 
approximation  to  the  coarser  one.  This  leads  to 
the  introduction  of  an  additional  function  space  Wm 
defined  to  be  the  orthogonal  complement  of  Vm  in 
Vm+  u  i-e->  Vm®  Wm=  Vm  +  i  and  Vm  ±  Wm.  This 
function  space,  or  detail  space,  Wm  contains  the 
information  which  is  lost  when  making  the  coarser 
approximation.  Conversely,  the  finer  approxima¬ 
tion  of  a  function  can  be  constructed  from  its 
projection  onto  the  coarser  approximation  space 
plus  its  projection  onto  the  associated  detail  space. 

A  basis  spanning  each  detail  space  can  be 
constructed  from  the  translations  of  some  loca¬ 
lized  function  ^/?(x),  known  as  a  wavelet.  Thus, 
Wm  may  be  spanned  by  {ipmn  =  2m/2ip(2mx-n), 
n  G  Z}.  Here  the  function  ^(x)  is  known  as  the 
‘mother  wavelet’  and  is  directly  related  to  the 
associated  mother  scaling  function  used  to  gen¬ 
erate  the  MRA.  It  is  the  combination  of  scaling 
functions  and  wavelets  which  are  commonly 
referred  to  as  a  wavelet  basis. 

There  exist  a  wide  variety  of  wavelets  con¬ 
structed  to  have  different  properties.  In  this  work 
we  use  two  types  of  wavelets,  specifically  the 
wavelet  known  as  Daubechies-6  (V6)  and  the  Haar 
wavelet.  In  Figure  1  and  Figure  2  the  mother 
scaling  function  and  mother  wavelet  are  shown  for 
X>6-  Both  wavelets  are  compact,  complete  and 
orthonormal.  In  addition,  the  V6  wavelet  has  a 
continuous  first  and  second  derivative. 

The  expansion  of  some  function  f(x)  onto  a 
wavelet  basis  will  therefore  have  the  form  of  an 
expansion  onto  the  translations  of  some  coarse 
scaling  function  plus  the  detail  of  the  expansion 
onto  the  translations  and  dilations  of  the  associ¬ 
ated  wavelet, 

+00  +OQ  +OQ 

/(*)  =  51  fon<fon(x)  +  5Z  51  fmni’nmix).  (1) 
n=— oo  m~ 0  n=— oo 

The  infinite  sums  over  all  translations  can  be 
removed  by  imposing  periodic  boundary  condi¬ 
tions,  which  leads  to  periodized  wavelets.  A  three 


ELECTRONIC  STRUCTURE  CALCULATIONS 


161 


3 

2.5 

2 

1.5 

1 

0.5 

0 

-0.5 

-il - > - « - i - ' - ' - ' - * - * 

-2  -1  0  1  2  3  4  5  6 

FIGURE  1  Real  space  plot  of  the  Daubechies-6  mother 
scaling  function. 


FIGURE  2  Real  space  plot  of  the  Daubechies-6  mother 
wavelet. 


dimensional  basis  can  be  constructed  from  tensor 
products  of  the  one  dimensional  basis. 

One  of  the  key  properties  of  a  wavelet  basis  is  its 
ability  to  represent  the  localized  detail  of  a 
function  using  a  small  number  of  wavelets.  This 
is  a  result  of  the  fact  that  the  wavelet  expansion 
coefficients,  fmn  in  Eq.  (1),  will  decay  rapidly  with 
increasing  m> M ,  where  the  local  smoothness  of 
the  function  can  be  characterized  by  the  length 


scale  ~  2~M .  If  the  magnitude  of  the  expansion 
coefficient  is  less  than  some  threshold  which  is  small 
compared  to  the  norm  of  the  function ,  then  no 
further  detail  is  needed  in  that  region.  This  leads  to 
the  idea  of  a  ‘compressed  basis’,  the  structure  of 
which  is  determined  by  the  above  criteria,  forming 
a  minimal  basis  set  representation  for  a  specific 
function.  Such  a  compressed  basis  will  be  used  to 
represent  the  electronic  orbitals  in  the  electronic 
structure  calculation. 


CALCULATION 

The  electronic  structure  of  the  system  is  deter¬ 
mined  here  using  the  local  density  functional 
theory  neglecting  the  spin.  Therefore  we  determine 
the  electronic  ground  state  by  solving  the  Kohn- 
Sham  equation1 , 

HKSil>i(r)=Ei1>i(r)  (2) 

where  the  electron  density  is  given  by  p(r) 
=  2X^1  |V7/(r)|2  f°r  a  2N-electron  system  since 
each  state  is  doubly-occupied  [7].  Details 

of  this  equation  and  methods  of  solution  can  be 
found  in  Ref.  [8]. 

A  wavelet  based  Laplacian  operator  is  needed 
for  the  kinetic  energy  term  of  the  Kohn-Sham 
Hamiltonian  as  well  as  in  the  solution  of  Poisson’s 
equation.  This  Laplacian  operator  is  performed  by 
matrix  multiplication.  The  elements  of  the  differ¬ 
entiation  matrix  are  pre-calculated  using  the 
method  described  in  Ref.  [9]  The  solution  of 
Poisson’s  equation  in  a  wavelet  basis  is  by  itself  an 
interesting  topic.  The  efficiency  of  the  wavelet  basis 
in  this  problem  is  well  established  [10]. 

The  operation  involving  the  multiplication  of 
the  wave  function  by  the  local  potential  is 
performed  using  the  following  algorithm.  First, 
both  the  wave  function  and  the  local  potential  are 
mapped  from  a  V $  wavelet  representation  to  a 


Atomic  units  are  used  throughout. 


162 


D.  A.  RICHIE  et  al 


Haar  wavelet  representation  using  a  transform. 
Then,  the  two  functions  are  multiplied  in  their  Haar 
representations.  The  product  is  then  mapped  back 
to  a  V 6  wavelet  representation  using  an  inverse  to 
the  transform  mentioned  above.  Thus,  the  multi¬ 
plication  is  performed  entirely  in  a  wavelet  basis. 

The  electronic  ground  state  of  the  system  is 
found  by  solving  Eq.  (2)  iteratively.  The  wavelet 
basis  is  held  fixed  until  convergence  is  achieved. 
The  basis  is  then  compressed,  removing  unnece¬ 
ssary  basis  functions.  Then  basis  functions  are 
added  to  effectively  double  the  resolution  globally. 
This  is  necessary  to  allow  the  basis  to  ‘grow’  to  the 
optimum  structure.  With  this  new  basis,  the  total 
energy  is  again  minimized  and  the  process  is 
repeated. 

RESULTS 

We  have  tested  this  method  by  applying  it  to  a 
supercell  containing  8  Si  atoms  located  at  their 
bulk  positions  and  using  a  local  s-pseudopotential. 
First,  a  simpler  calculation  is  performed  where  the 
multiplication  of  the  electronic  orbital  with  the 
local  potential  is  performed  by  first  mapping  both 
functions  to  real  space  via  a  fast  wavelet  transform, 
multiplying  the  two  functions  and  then  mapping 
back  to  wavelet  space.  This  procedure  is  similiar  to 
that  used  in  Ref.  [5].  In  this  calculation  there  is  no 
compression  since  the  full  cubic  grid  must  be  used. 
This  “full  grid”  method  is  performed  for  increasing 
grid  sizes  of  43, 83, 163  and  323.  The  results  are 
shown  in  Figure  3.  The  correct  degeneracies  (1-6- 
6-3)  of  the  16  eigenvalues  are  achieved  consistent 
with  the  symmetry  of  the  problem. 

Next,  the  same  calculation  is  performed  with  an 
adaptive  wavelet  basis.  Here,  the  multiplication  is 
performed  using  the  algorithm,  described  above, 
where  the  entire  calculation  is  performed  in  a 
wavelet  basis.  The  correct  degeneracies  are  again 
achieved  to  within  10-4  a.u.  The  adaptive  basis 
converges  to  the  correct  eigenvalues.  For  the  three 
lowest  levels  the  convergence  requires  less  basis 
functions. 


FIGURE  3  Eigenvalues  of  the  Kohn-Sham  Hamiltonian  for  a 
supercell  containing  8  Si  atoms.  A  local  s-pseudopotential  is 
used.  The  eigenvalues  are  plotted  vs.  (basis  size)1/3.  Compari- 
sion  is  between  the  adaptive  and  non-adaptive  calculations 
indicated  by  the  solid  and  dashed  lines,  respectively. 


CONCLUSION 

We  have  demonstrated  that  it  is  possible  to 
minimize  the  total  energy  in  an  electronic  structure 
calculation  while  simultaneously  optimizing  the 
wavelet  basis  set  used  to  represent  each  electronic 
orbital.  This  general  method  should  lead  to 
efficient,  potentially  linearly  scaling  electronic 
structure  calculations.  Furthermore,  if  the  atoms 
are  allowed  to  move  as  in  a  molecular  dynamic 
simulation,  the  adaptive  basis  should  adjust  to  the 
needs  of  the  structure.  Presently,  the  various 
corrections  resulting  from  exchange-correlation 
and  non-local  effects  need  to  be  implemented  to 
perform  calculations  precise  enough  to  compare 
with  other  more  established  methods.  The  advan¬ 
tages  of  an  adaptive  wavelet  basis  should  lead  to 
efficient  methods  when  applied  to  larger,  localized 
structures. 
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Quantum  transport  is  becoming  more  significant  as  device  size  shrinks.  For  example,  as 
device  sizes  are  scaled  below  0.1  pm,  the  number  of  impurities  becomes  quite  small,  so 
that  they  are  no  longer  homogeneously  distributed  throughout  the  device  volume  and 
the  carriers  are  localized  into  quantum  boxes ,  in  that  self-energy  corrections  produced  by 
locally  high  carrier  densities  will  lead  to  quantum  dot  formation.  This  leads  to  the  need 
to  discuss  transport  through  an  array  of  such  quantum  structures .  Here,  we  discuss 
several  issues  which  must  be  considered  in  treating  the  transport  through  such  devices. 


Keywords :  Quantum  transport,  device  modeling,  impurities 


1.  INTRODUCTION 

The  demand  for  enhanced  performance  in  VLSI 
circuits  pushes  semiconductor  devices  to  miniatur¬ 
ization  limits,  at  which  point  leading  edge  devices 
will  employ  0.07  pm  gate  lengths.  There  is  evidence 
that  we  can  go  beyond  this  size,  and  many 
laboratories  have  produced  experimental  devices 
in  the  20-30  nm  gate  length  range.  Thus,  one  can 

approach  sizes  that  are  only  100  x  the  atomic 

spacing  in  the  semiconductor.  We  must  treat 
transport  in  a  quantum  mechanical  manner, 
though  we  would  like  to  extrapolate  from  present 
classical  approaches,  and  retain  the  appealing 
framework  of  the  latter  formalism. 


One  major  problem  is  caused  by  the  impurity 
atoms.  For  these  devices,  the  active  region  will 
contain  so  few  dopant  atoms  that  fluctuations  in 
this  number  will  affect  device  performance.  The 
impurity  profiles  determine  the  device  configura¬ 
tion,  and  most  assumptions  presume  that  the 
impurities  are  randomly  dispersed  with  a  uniform 
doping  profile.  However,  the  fluctuation  in  the 
number  of  total  particles  in  the  channel  for  small 
devices  is  comparable  to  the  actual  number  itself. 
In  addition,  it  can  be  expected  that  the  devices  will 
exhibit  noticeable  conductance  variations.  It  is 
clear  that  the  carrier  density  in  the  channel  is  very 
inhomogeneous  and  this  will  lead  to  carrier 
localization  in  regions  approximately  10-20  nm 
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on  a  side,  because  the  exchange  energy  leads  to 
band-gap  narrowing  in  this  region.  Consequently, 
these  carriers  sit  in  small  3D  quantum  boxes 
containing  10-20  electrons  per  box  [1].  Yet  most 
approaches  to  classical  device  simulation  do  not 
adequately  treat  this  fluctuation  in  the  carrier 
density  and  doping,  nor  indeed  the  strong  Cou¬ 
lomb  correlations. 

Several  issues  must  be  considered  in  treating  the 
transport  through  such  devices,  among  which  are: 

(1)  Phase  coherence  within  the  array  of  quantum 
boxes; 

(2)  The  transition  between  semi-classical  and  fully 
quantum  transport,  including  the  ability  or 
inability  of  the  present  heuristic  methods 
based  on  a  quantum  potential  to  properly 
describe  charge  bunching  and  tunneling; 

(3)  The  role  of  the  contacts,  vis-a-vis  the  fabri¬ 
cated  boundaries,  and  the  actual  versus  inter¬ 
nal  boundaries; 

(4)  Electron  correlation  due  to  the  small  capaci¬ 
tances  between  such  dots,  and  the  possibility 
of  single-electron  effects  in  fluctuations  of  the 
overall  device; 

(5)  The  possible  need  to  provide  decoherence  at 
the  source  or  drain. 

Here,  we  address  some  of  these  problems.  To 
begin,  we  discuss  a  general  quantum  formulation, 
as  well  as  providing  an  approach  that  will  allow 
studies  of  the  above  questions.  We  discuss  a 
prototypical  quantum  dot,  and  its  experimental 
behavior,  before  considering  the  quantum  trajec¬ 
tories  and  possible  extension  of  device  descriptions 
such  as  the  quantum  hydrodynamic  approach  and 
the  quantum  potential. 

2.  COUPLING  THE  DEVICE 

TO  THE  SYSTEM 

Let  us  consider  an  isolated  quantum  device  (e.g., 
one  quantum  dot  in  a  real  transistor),  which  is 
embedded  in  its  environment.  The  principles  are 
described  by  the  isolated  device,  its  input  and 
output  contacts,  and  the  coupling  between  these 


parts.  The  device  is  described  by  its  Hamiltonian 
Hd  and  wave  function  tyd,  while  the  input  and 
output  contacts  are  described  by  the  Hamiltonians 
Hu  and  Hy  and  wave  functions  and  SS?y, 
respectively.  The  Hilbert  space  for  the  total  wave 
function  is  a  summation  space,  rather  than  a 
tensor  product  space  which  is  often  used  to 
describe  system-environment  interactions.  That 
is,  there  is  a  complete  Hilbert  space  for  each  of 
the  wave  functions,  and  the  total  space  is  the 
summation  of  these  three  spaces.  This  choice  of 
space  allows  us  to  talk  about  wave  functions 
localized  in  the  device,  or  in  the  contact,  or 
hybridized  between  them  using  wave  functions 
isolated  in  one  part  of  the  overall  Hilbert  space.  It 
is  assumed  that  the  Hamiltonian  remains  separable 
into  parts  representing  each  space  plus  coupling 
terms.  Hence,  tyd,  \I>W  and  fyy  represent  members  of 
the  complete  Hilbert  spaces  for  those  isolated  parts 
of  the  total  system.  From  some  simple  manipula¬ 
tions,  we  can  arrive  at  the  transmission  matrix 
from  a  pure  input  state  to  a  pure  output  state 

t  =  {Hd  +  AH-XR)~XCH-XA.  (1) 

Here,  A ,  C  and  R  are  the  coupling  matrices  from 
the  input  contact  to  the  device,  from  the  device 
to  the  output  contact,  and  from  the  device  back  to 
the  input  contact,  respectively.  There  are  at  least 
three  different  cases  of  interest  [1]. 

The  case  that  is  most  relevant  to  the  present 
discussion  is  particular  to  quantum  dots  in  which 
the  contacts  are  open  quantum  point  contacts  and 
we  cannot  ignore  AH~lR.  The  nature  of  these 
quantum  point  contacts  is  that  the  modes  in  the 
contact  region  are  eigenmodes  of  the  effective 
waveguide,  and  this  produces  significant  collima- 
tion  of  the  entering  particle  beam  [2].  There  is  then 
a  significantly  different  behavior  in  the  dot,  and 
the  response  is  governed  by  the  semiclassical 
trajectories  that  are  excited  in  the  dot  by  the 
waveguide  modes.  This  behavior  is  clearly  seen  in 
both  square  dots  [3]  and  in  stadium  (oval)  dots 
that  are  classically  chaotic  [4,  5],  as  may  be  seen  in 
Figure  1.  In  these  dots,  in  which  only  a  small 
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FIGURE  1  Collimation  of  the  input  electrons  by  the  quantum  point  contact  leads  to  the  excitation  of  heavily  scarred  wave 
functions,  representative  of  the  regular  trajectories  in  the  ballistic  quantum  dots.  Typical  scars  are  shown  for  a  square  dot  (a)  and  an 
oval,  or  stadium,  dot. 


family  of  trajectories  is  excited  by  the  collimated 
beam,  the  spectral  density  is  still  evolving  from  a 
series  of  delta  functions,  and  the  regular  paths  give 
oscillatory  contributions  to  the  spectral  density. 
Indeed,  measurements  of  the  phase  breaking  time 
have  clearly  shown  the  2D  to  OD  transition  in  the 
dot  as  the  temperature  is  lowered  (below  130  mK 
in  a  1.0  micron  dot)  [6].  The  periodic  orbits 
combine  coherently  to  produce  much  stronger 
oscillations  than  those  of  an  isolated  orbit  of  a 
chaotic  system.  This  is  important,  as  the  coherence 
of  the  phase  integrals  leads  to  quite  long  and 
periodic  orbits.  It  is  the  periodic  nature  of  these 
long  orbits,  which  contributes  a  large  number  of 
terms  to  the  spectral  density,  corresponding  to 
returns  to  the  input  contact  after  different  numbers 
of  traversals.  The  interference  among  these  waves 
of  successive  circuits  gives  rise  to  the  set  of 
quantized  levels  that  appear  in  the  dot. 

The  role  of  the  contacts  can  also  be  seen  in  the 
actual  phase-breaking  time  measured  in  the  dots. 
In  Figure  2,  we  show  measurements  on  a  set  of 
nominally  square  dots  [7].  Here,  the  phase-break¬ 
ing  time  is  found  to  be  relatively  independent  of 
the  dot  area,  but  does  depend  upon  the  opening  of 
the  QPCs.  If  the  overall  resistance  is  less  than  12 
kft,  the  phase-breaking  time  is  about  30  ps.  On  the 
other  hand,  if  the  overall  resistance  is  greater  than 


15  kft,  the  phase-breaking  time  increases  by  an 
order  of  magnitude.  We  interpret  this  as  an 
additional  phase-breaking  process,  external  to 
the  dot,  that  is  coupled  to  the  dot  by  a  sufficient 
opening  of  the  QPCs. 

What  we  have  found  in  the  above  discussion 
correlates  well  to  what  we  know  from  our  under- 


FIGURE  2  The  phase  breaking  time  measured  experimen¬ 
tally  for  three  different  sizes  of  ballistic  quantum  dot.  The 
important  aspect  is  that  each  shows  a  transition  to  a  much 
smaller  value  of  the  phase  breaking  time  as  the  quantum  point 
contacts  are  opened  (smaller  resistance)  so  that  the  dot  is  more 
strongly  coupled  to  the  environment. 
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standing  of  classical  device  simulation:  The  nature 
of  the  contacts  can  dramatically  affect,  and  even 
dictate,  device  performance.  This  point  is  often 
missed  in  discussions  of  quantum  transport  which 
are  derived  from  equilibrium  quantum  statistical 
mechanics.  The  role  of  the  boundaries  cannot  be 
overlooked,  and  usually  is  the  dominant  control 
mechanism  in  device  performance.  Without  proper 
treatment  of  these  boundary  conditions,  no  ade¬ 
quate  quantum  device  simulation  can  be  attained. 

3.  TRAJECTORIES,  PATHS 
AND  POTENTIALS 

Let  us  now  return  to  the  question  of  the  DOS,  as 
measured  by  the  transport  in  the  quantum  dot.  In 
square  quantum  dots,  we  have  found  from 
comparison  between  theory  and  experiment  that 
the  transport  behavior  is  generally  regular.  By 
regular,  it  is  meant  that  the  quasi-ballistic  motion 
scatters  elastically  off  the  walls  of  the  dot.  In  the 
fully  quantum  limit  of  a  zero-dimensional  dot,  the 
density  of  states  (at  T  =  0)  is  simply  a  sum  of  delta 
functions,  located  at  the  energy  levels  of  the  dot, 
although  the  DOS  must  change  to  the  uniform 
two-dimensional  form  as  these  delta  functions  are 
broadened.  For  a  classically  chaotic  system,  it  has 
been  thought  that  opening  the  quantum  billiard 
(to  interactions  with  the  outside  world)  will 
introduce  chaotic  behavior  in  the  level  statistics 
and  in  the  transport.  Only  recently  have  the 
measurements  of  magnetotransport  in  fully  quan¬ 
tized  dots  (A  ^>kBT)  appeared  that  show  the 
residual  classically  regular  transport ,  ruling  out 
the  normally  accepted  chaos  theory. 

For  regular  motion,  only  one  family  of  periods 
may  be  observed,  but  this  family  will  be  composed 
of  the  fundamental  and  several  harmonics,  as  seen 
in  the  experimental  results.  We  are  led  to  the 
assertion  that  the  study  of  transport  in  “regular” 
billiards  exposes  the  fundamental  quantum  structure 
of  the  dots  themselves,  and  provides  important 
information  as  to  how  the  transport  will  exhibit 
itself  in  future  electron  devices. 


The  understanding  of  how  classical  flows  evolve 
into  quantum  mechanical  currents  is  still  an  open 
issue.  There  has  been  considerable  controversy 
over  the  description  of  phase  space  trajectories  in 
quantum  mechanics.  The  incorporation  of  a 
quantum  potential  is  supposed  to  provide  quantum 
mechanical  deviations  from  the  classical  orbits. 
The  most  common  approach  to  the  quantum 
potential  is  through  the  decomposition  of  the  wave 
function  as 

t)  =  Vp(rJ)  exp  (iS(r,t)/h),  (2) 

where  p(r,  /)  is  the  density  (p  also  will  be  used  as 
the  density  matrix  later,  but  this  is  not  its  meaning 
here),  r  and  t  are  the  space-time  manifold,  and 
S(r,  t)  h  is  the  phase  associated  with  the  motion. 

An  unambiguous  construction  of  quantum 
trajectories,  associated  with  pure  states  (in  both 
configuration  space  and  phase  space)  is  provided 
by  the  Bohm-de  Broglie  interpretation  of  quantum 
mechanics  [8,  9].  Here,  the  Schrodinger  equation 
for  a  carrier  describes  the  deterministic,  causal 
evolution  of  a  complex  field,  but  is  augmented  by 
the  postulate  that  the  carrier  may  be  considered  as 
a  classical  point  particle  with  position  vector  *(/) 
and  momentum  vector  p(t)  embedded  within  the 
pilot  field  ^(r,  t ),  given  by  (2).  At  any  time  t0 ,  the 
carrier  is  found  with  probability  density  p(r,  t0)  to 
have  position  x(t0)  =  r  and  p(t0)  =  VS(r,  /0). 
Thereafter,  the  particle  trajectory  in  configuration 
space  and  phase  space  is  followed  deterministically 
under  the  influence  of  the  pilot  field  via  the 
parameterization 

x{t)  =  *(*<>)+  [  p{t')dt', 

Jt0  (3) 

p(t)  =  VS[r(/),/]. 

This  picture  is  not  a  transformation  to  a  particle 
picture  which  can  replace  the  quantum  description 
of  the  system,  but  instead  provides  a  description  in 
a  kinematical  sense  of  the  behavior  of  the  system. 
The  evolution  of  S ,  and  hence  p,  is  provided  by  the 
Schrodinger  equation,  which  is  a  separate  field 
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theory.  Classically,  the  action  S  is  determined 
completely  by  the  underlying  particle  dynamics  in 
the  system,  viz.  the  Hamilton-Jacobi  equations.  The 
quantum  “action”  S  is  also  claimed  to  be  deter¬ 
mined  by  a  Hamilton-Jacobi-like  equation,  which 
differs  from  the  classical  version  by  coupling  to  the 
continuity  equation  through  the  quantum  potential 
Vq.  The  latter  determines  the  probability  density  of 
locating  the  particle  within  quantum  theory,  as 
noted  previously,  and  satisfies  the  continuity 
equation.  However,  S  is  not  determined  entirely 
by  the  local  dynamics,  but  must  satisfy  a  crucial 
topological  condition  which  follows  from  its  origin 
as  the  phase  of  an  independent  complex  field. 

One  must  be  sure  to  properly  incorporate  the 
single-valuedness  of  the  wave  function,  otherwise 
the  equation  of  motion  for  S  will  not  be  properly 
constructed.  In  particular,  we  note  that  S  is 
defined  modulo  h  (Planck’s  constant).  Since,  the 
velocity  is  normally  described  from  v  =  VS/m ,  the 
periodic  nature  of  S  leads  to  the  quantization 

j)  v  •  dr  =  Ih/m ,  (4) 

where  /  is  an  integer.  In  classical  hydrodynamics, 
the  integral  on  the  left  is  known  as  the  velocity 
circulation,  and  is  related  through  Stoke’s  theorem 
to  the  vorticity  of  the  motion.  Most  approaches  to 
the  use  of  dynamical  equations  ignore  the  quanti¬ 
zation  within  the  system.  The  quantization  must 
be  put  in  quite  early  in  the  process,  as  it  will  not 
evolve  from  a  classical  distribution  function. 

If  we  consider  any  closed  contour  for  which  (4) 
holds,  we  can  write  at  any  point  S'  -  S+Nh.  Let 
us  suppose  that  N^O  and  that  somewhere  within 
the  area  enclosed  by  the  contour,  S  goes  to  zero. 
This  must  occur  where  ^fp  =  0,  i.e.,  at  the  nodes 
which  are  required  to  preserve  the  continuity  of 
the  wave  function  itself.  At  these  points,  both  the 
real  and  the  imaginary  parts  of  the  wave  function 
vanish,  and  this  describes  the  nodal  lines  in  space. 
In  addition  to  the  quantized  orbits  (which  can  be 
extended  to  any  multiply-connected  region),  the 
nodal  lines  are  regions  in  which  the  motion  is  no 
longer  irrotational. 


It  is  important  to  point  out,  that  closed  orbits  in 
a  ultrasmall  device  can  be  induced  not  only  by 
reflections  between  two  barriers  (the  resonant  level 
in  the  double-barrier  tunneling  device),  but  also  by 
multiple  scattering  from  impurities  (the  source  of 
weak  localization  and  universal  conductance 
fluctuations),  or  from  any  combination  of  these 
factors.  The  latter  is  important  for  universal 
conductance  fluctuations  and,  indeed,  has  been 
suggested  for  the  observation  of  weak  localization 
in  a  single  quantum  point  contact. 

One  can  make  a  minimalist  improvement  on  the 
Bohm-deBroglie  theory,  which  is  normally  descri¬ 
bed  by  [10] 

(6(x(t)  -  r)>  =  p(r,  0,  Po  =  VS(r,  0-  (5) 

We  assume  that  x (t)  is  a  stochastic  variable, 
such  that  the  mean  momentum  is  given  by  the 
gradient  of  the  phase  as  in  (5): 

(p)  =  J  d3xPVS(r,  t).  (6) 

To  account  for  the  stochastic  nature,  we 
introduce  a  momentum  fluctuation,  which  is 
defined  through  ppx  =  tNp/ 2.  Then,  we  find  a 
new  result  for  the  mean  square  momentum  as 


(p2)  =  J  <Px{p[pl+p^}.  (7) 


This  is  the  true  quantum  mechanical  expectation 
value,  as  compared  to  the  Bohm  picture,  which 
omits  the  term  in p\.  As  noted  in  [1 1],  this  omission 
leads  to  a  violation  of  Heisenberg’s  uncertainty 
relation.  In  this  extended  picture,  the  equations  for 
continuity  and  momentum  are  given  by 


as 

dt 


pI+p\ 

2m 


+  V  — 


n 

Imp 


v  •  (ppi), 


t+v-P)-  0. 

at  \  m  J 


(8) 

(9) 


The  traditional  momentum  equation  is  now 
found  by  taking  the  divergence  of  (8),  but  it  is  clear 
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that  the  quantum  potential  now  splits  into  two 
components 


n 

2m  p 


v  •  (ppi), 


(10) 


where  the  leading  the  term  is  related  to  the  kinetic 
energy  associated  with  the  quantum  fluctuations. 
The  second  term  in  (10)  involves  the  divergence  of 
the  fluctuation  current  and  its  spatial  average  over 
the  position  probability  density  vanishes.  This  is  a 
crucial  point,  and  restores  the  inherent  statistical 
nature  of  quantum  mechanics  to  the  apparent 
deterministic  form  of  the  Bohm  picture.  We  find 
that  pi  satisfies  identically  a  stationary  Fokker- 
Planck  equation 


reinforcing  our  view  that  p\  describes  the  fluctua¬ 
tions.  The  above  equations  are  an  alternative  and 
more  acceptable  semantic  deconstruction  [12]  of 
the  Schrodinger  equation  (which  must  be  augmen¬ 
ted  by  the  phase  boundary  condition  (4)  as 
before),  and  points  out  that  there  are  many 
different  definitions  of  a  “quantum  potential”. 


4.  PROBLEMS  OF  THE  QUANTUM 
POTENTIAL 

Over  and  above  the  above  problems  of  the 
uniqueness  of  a  quantum  potential,  the  more 
normally  used  one  has  its  own  problems  with 
any  sort  of  fluctuations.  Using  the  wave  function 
(2)  for  a  pure  state,  we  may  easily  write  two 
continuity  equations  for  the  probability  density 


where 


VQ 


t?  v\/p 

2m  y/p 


(14) 


is  the  more  usual  quantum  potential.  Taking  the 
gradient  of  (13)  produces  an  Euler  equation  which 
includes  the  additional  force  arising  from  the 
quantum  potential,  and  this  has  been  the  justifica¬ 
tion  for  much  of  the  usage  of  the  heuristic 
quantum  hydrodynamic  equations  for  mixed  state 
systems.  One  must  be  careful  to  note  the  multi¬ 
valued  nature  of  5,  quantization  of  the  orbits,  and 
the  boundary  conditions,  scattering,  and  the 
continuity  of  the  wave  function. 

To  extend  the  above  approach  to  a  mixed  state, 
one  can  use  the  same  equations  above,  but  this 
incorporates  errors  and  approximations.  Instead, 
let  us  write  the  density  matrix  as 


N 


p(ri  r  1  0  ”  v  Pnl'tpn)  ($n\  —  ^  ^  P npm 

n  =  X  - 1  (15) 


N 


= 


m 


n=\ 


where,  for  generality,  we  do  not  specify  any 
functional  form  a  priori  for  the  probabilities  Pn 
for  the  states  We  may  now  introduce  an 
average  velocity  through  the  prescription 

N 

pv*  =  £  =  PnpnVn,  (16) 

n=  1 


for  which  the  continuity  Eq.  (12)  is  recovered  for 
the  average  velocity  rather  than  that  of  any  pure 
state.  We  find  that  a  single  quantum  potential  does 
not  immediately  appear  in  the  problem,  and 


^  +  V-(pv)  =  0, 


and  for  the  action 


dS  (VS)2 
dt  2m 


+  v+vQ  =  0, 


(12) 


dva 

m—  +  m 


N 

£  P„[v„V  •  (p„vn)  +  pn(vn  ■  V)v„] 


1 


-  VaV  •  (pva) 


N 

=  _VF-£/V„VFe„. 

n=l 

(17) 


(13) 
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Clearly,  the  “energy”  term  is  quite  complex,  and 
not  simply  related  to  a  single  temperature ,  and 
even  the  diagonal  terms  contain  significant  fluc¬ 
tuations  beyond  those  related  to  the  velocity 
fluctuations.  The  quantum  potential  is  not  that  of 
the  pure  state .  Suppose  we  have  a  small  fluctuation 
around  some  steady  value,  in  which  we  can  define 
the  parameters  such  that 


the  last  section.  It  is  this  second  term  that  inspires 
the  introduction  of  px  in  the  previous  section. 

Finally,  let  us  return  to  the  crucial  topological 
condition  on  the  phase  of  individual  quantum 
states,  as  expressed  through  condition  (4).  This 
condition  achieves  enormous  complexity  for  mixed 
state  descriptions,  and  leads  to  a  condition  on  the 
average  velocity  which  takes  the  form 


N 


Pn  =  P  +  A  pn  (...)  =  P»  >  P  =  (Pit)- 

(18) 


n=  1 


In  this  case,  the  leading  correction  term  is  of 
second  order,  but  we  can  write  the  new  quantum 
potential  as 

N 

^PnPn^VQN  — ►  pVFg(l  —  a(ApnApn))  +  . . . 

n=  1 

(19) 


The  term  in  the  angular  brackets  represents  a 
significant  contribution  to  the  overall  density- 
density  correlation  function,  which  is  known  to 
produce  weak  localization  and  universal  conduc¬ 
tance  fluctuations  in  mesoscopic  devices.  Hence, 
we  see  that  there  is  a  modification  of  the  quantum 
potential  in  the  mixed  state  due  to  these  fluctua¬ 
tions,  which  are  readily  induced  by  scattering 
processes.  We  can  understand  the  source  of  the 
correction  term  by  rewriting  (14)  as 


Vq 


n2  vVp 

2m  yfp 


h2 

\2P 

(vpY 

2m 

. 2p 

\2p) 

(20) 


The  second  term  is  directly  related  to  the  zero- 
point  fluctuations,  and  it  is  clear  that  the 
additional  fluctuations  introduced  by  scattering, 
boundaries,  etc.,  will  produce  a  major  contribution 
to  the  density -density  correlation  function 
through  this  term.  It  is  the  corrections  arising 
from  this  term  that  dominantly  lead  to  (19)  and, 
indeed,  it  is  the  properties  of  the  second  term  that 
are  important  in  introducing  the  fluctuations  of 


(2i) 

subject  to  the  N  conditions 

j)  vn-  dr  =  lnh/m.  (22) 

Because  the  quantum  numbers  ln  are  positive  or 
negative  integers,  there  is  the  possibility  that  the 
circulation  Ta ,  which  is  a  superposition  of  terms 
involving  the  /„,  will  “phase  mix”  to  zero.  If  this 
were  exact,  the  quantum  hydrodynamic  equations 
would  indeed  be  simple  extensions  of  classical 
hydrodynamic  theory  apart  from  the  quantum 
potential  terms.  However,  to  our  knowledge  this 
point  has  never  been  explored. 


5.  DISCUSSION 

Why  has  our  emphasis  been  on  the  concept  of 
defining  a  quantum  kinetic  (trajectory  based) 
picture,  or  even  that  of  a  quantum  hydrodynamic 
(streamlines  based)  picture?  As  discussed  above, 
experiments  are  now  available  on  ballistic  quan¬ 
tum  dots  in  which  one  can  study  both  the  quantum 
transport  and  the  classical  ballistic  transport  in  a 
manner  that  good  agreement  is  achieved  between 
the  two  and  with  experiment.  Moreover,  hydro- 
dynamic  approaches  have  been  extensively  pur¬ 
sued  for  “quantum”  simulation  of  small  devices. 
Here,  however,  we  have  demonstrated  significant 
limitations  and  problems  in  the  latter. 

Our  motivation  is  to  find  a  true  quantum 
kinetic,  or  quantum  hydrodynamic,  picture  that 
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properly  allows  for  the  fluctuations,  and  recog¬ 
nizes  the  particular  problems  which  derive  from 
the  explicit  non-locality  of  the  problem  and 
properly  accounts  for  the  topological  phase 
boundary  conditions.  The  success  of  Monte  Carlo 
suggests  that  one  might  analogously  find  a  Monte 
Carlo,  trajectory-based,  non-deterministic  picture 
for  the  Schrodinger  equation  and/or  the  density- 
matrix  Liouville  equation. 

To  this  end,  we  can  return  to  the  ballistic 
quantum  dots,  as  the  regular  trajectories  that  exist 
in  these  structures  provide  an  ideal  testbed  for 
studying  the  evolution  of  the  classical  trajectories 
into  the  quantum  kinetic  picture.  Moreover,  these 
structures  are  explicitly  dependent  upon  the  details 
of  the  confining  boundary  potentials  and  the  initial 
conditions  defined  by  the  contacts.  They  are  an 
ideal  structure  to  study  quantum  kinetics  without 
the  important  quantum  effects  being  masked  by  an 
overlaying  role  of  inelastic  processes,  which 
ultimately  destroy  much  of  the  quantization. 
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It  is  argued  that  the  Pauli  master  equation  can  be  used  to  simulate  electron  transport  in 
very  small  electronic  devices  under  steady-state  conditions.  Written  in  a  basis  of  suitable 
wavefunctions  and  with  the  appropriate  open  boundary  conditions,  this  equation 
removes  some  of  the  approximations  which  render  the  Boltzmann  equation 
unsatisfactory  at  small  length-scales.  The  main  problems  consist  in  describing  the 
interaction  of  the  system  with  the  reservoirs  and  in  assessing  the  range  of  validity  of  the 
equation:  Only  devices  smaller  than  the  size  of  the  electron  wavepackets  injected  from 
the  contacts  can  be  handled.  Two  one-dimensional  examples  solved  by  a  simple  Monte 
Carlo  technique  are  presented. 


Keywords:  Master  equation,  electron  transport,  small  devices 


1.  INTRODUCTION 

As  noted  by  Frensley  discussing  quantum  trans¬ 
port  in  open  systems  [1],  the  Pauli  master  equation 
[2],  (PME)  could  constitute  an  intuitive  description 
of  electron  transport  in  semiconductor  devices  of 
size  comparable  to  the  electron  wavelength:  One 
could  capture  the  wavelike  nature  of  transport, 
lost  in  the  Boltzmann-transport-equation  (BTE) 
picture,  so  bypassing  the  weak-field,  long  devices 
limitations.  In  retaining  the  weak- scattering  and 
completed-collision  limits  one  would  not  reach  a 
full-fledged  quantum  description,  as  required  to 
study  very  fast  time  transients  [3,  4].  Yet,  as 
devices  approach  the  sub-50  nm  length-scale,  the 


advantage  of  avoiding  the  concept  of  point-like 
electrons  appears  overwhelming. 

Although  arguments  have  been  raised  against 
the  correctness  of  the  PME  [1],  it  is  applicable  to 
very  small  devices  and  to  steady-state  phenomena: 
Small  devices ,  because  the  PME  rests  on  the 
absence  of  off-diagonal  elements  of  the  density 
matrix  p.  In  turn,  their  absence  implies  that  the 
contacts  inject  ‘plane  waves’,  which  corresponds 
physically  to  the  injection  of  spatially  highly 
delocalized  wavepackets.  Hence,  the  device  must 
be  smaller  than  the  ‘dephasing  length’  in  the 
contacts  [5].  For  heavily  doped  Si,  this  is  of  the 
order  of  50  nm,  which  is  of  technological  interest. 
Steady  state ,  since  any  nontrivial  time  transient 
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(e.g.,  relaxation  of  photo-excited  carriers,  fast 
switching  of  a  device)  would  excite  those  off- 
diagonal  terms  of  p  which  the  PME  cannot  handle. 
Indeed,  as  also  note  by  Frensley  [1],  the  PME  is 
inconsistent  with  current  continuity  in  this  case. 


2.  MASTER  EQUATION  AND  CONTACTS 

Let’s  consider  a  semiconductor  region  unbounded 
in  the  (x,  y)-plane,  homogeneous  in  this  plane,  in 
contact  with  two  reservoirs  at  z = 0  and  z=L.  Let’s 
write  the  single-particle  Hamiltonian  as: 


/i2V2 

2m* 


eV(z)  -f-  f/int  -f-  Hies 


—  Ho  !  f Ant  -|-  -fires) 


(1) 


within  the  envelope/effective-mass  approximation, 
m*  being  the  effective  mass.  Hint  describes  the 
electron-phonon  interaction  or  interactions  with 
impurities,  etc.  1 Htes  represents  the  interaction 
between  the  device  and  the  external  reservoirs. 
The  time  evolution  of  the  system  is  given  by  the 
‘transport’  equation  for  the  ‘reduced’  (electronic) 
density  matrix  p: 


dp 

dt 


=  l\P,H]  + 


(2) 


The  last  term  on  the  right-hand-side  describes  the 
effect  of  the  reservoirs.  Assuming  HinX  =  a  H',  we 
can  derive  a  transport  equation  from  Eq.  (2),  by 
considering  only  terms  to  the  leading  order  in  a, 
and  following  the  ‘standard’  derivation  of  Kohn 
and  Luttinger  [7],  but  now  using  as  basis  functions 
not  plane  waves,  but  the  eigenstates  \p,K)  of  H0, 
such  that  H0\pK)  =  [Efi  + h2K2/(2m*)]\pK)  = 
E„(K)\fj,K).  This  simplifyies  Eq.  (2),  since  the 
commutator  now  contains  only  the  interaction 
term  Hint.  The  wave-functions  are  simply  of  the 
form  (M(z)  exp  (iK-  R)/(2n),  where  K  and  R  are  the 
wavevector  and  position  on  the  (x,  j)-plane,  and 
the  functions  £M(z)  obey  the  one-dimensional 


Schrodinger  equation  for  the  potential  V(z), 
subject  to  the  boundary  conditions  F(0)  =  VL  and 
V  (L)=  V R.  Taking  —eVL>  -eVR,  for  any  given 
energy  Efl>  ~eVL,  two  independent  solutions 
represent  waves  incident  from  the  left  (  +  )  or  right 
(— ),  and  partially  reflected  and  transmitted,  with 
reflection  and  transmission  amplitudes  r*  and  t±. 
For  —  eVL>E^>  —  eVR,  the  nondegenerate  solu¬ 
tions  represent  non-current-carrying  states.  In  the 
following  the  index  a  =  ±  will  label  left-  and 
right-traveling  waves  in  the  energy  range  EM> 
eVL,  and  {z\prr)  =  <£(z). 

To  the  leading  order  in  a,  the  diagonal  elements, 
PuK*  (symbol  used  in  place  of  p^k^ko)  of  the 
density  matrix  obey  the  Eqs.  [7-9]: 


o.  ^  v  [  HjtATcr;  vK'a'PvK'o1  ^EuKlal',fiKfiP^iKa\ 

U  vKW^/iKa 

I  ( dPiiKo  \ 

V  dt  L 

(3) 

here  WllK^’.l:Kn  represents  the  probability  per  unit 
time  of  a  transition  from  the  state  |  pKo)  to  the 
state  \v  K'a')  using  Fermi  golden  rule.  While  the 
derivations  provided  in  [7  -  9]  can  be  followed  very 
easily,  now  one  must  also  assume  that  the 
reservoirs  do  not  alter  the  ratio  a  between  off- 
diagonal  and  diagonal  matrix  elements.  Van  Hove 
[8]  has  shown  the  validity  of  Eq.  (3)  in  describing 
approach  to  equilibrium,  in  the  case  of  closed 
systems  (for  which  the  term  (dp/dt)Tes  is  absent), 
provided  the  initial  state  is  chosen  as  ‘quasi- 
diagonal’:  At  /  =  0  we  must  have  pxlfCovKW  =  0  for 
Et,(K)  —  EJKr)  >  6,  where  S/h  is  of  the  order  of 
the  relaxation  rate,  1/r.  Similarly,  Eq.  (3)  is  valid 
as  long  as  the  contacts  do  not  ‘inject’  off-diagonal 
elements,  that  is  (dp^^/dt)^  =  0  for  E^K) 
—  E„(K!)  >  6.  This  is  the  case  for  very  delocalized 
wavepackets  entering  the  contact,  plane  waves 
being  the  ideal  limiting  case.  In  this  limit,  the  effect 
of  the  contacts  can  be  modeled  phenomenologi¬ 
cally  by  assuming  that  the  left  reservoir  attempts 
to  restore  charge  neutrality  by  injecting  a  flux  of 
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particles  of  any  K  into  the  right-bound  component 
of  the  states  associated  to  waves  incident  from  the 
left,  fixed  by  the  Fermi  level  at  the  cathode,  Ep. 
Accounting  also  for  the  flux  out  of  the  device  via 
either  contact,  one  can  write: 

(4) 

where  =  YIkP^ko,  T*  =  \AJtJ\2/\A^\2,  hk±  = 
[2m*(Ep  +  eVRfL)}lf2,  and  the  A9 s  are  normali¬ 
zation  constants.  Similar  expressions  also  hold  for 
states  traveling  from  the  right.  The  occupation 
of  the  states  at  thermal  equilibrium,  as  determined 
by  the  ‘left’  reservoir,  has  been  expressed  in  terms 
of  the  equilibrium  Fermi  function  (integrated  over 
the  ‘transverse’  wavevectors  K ),  f^L)  (Ep),  pos¬ 
sibly  shifted  in  A;-space,  in  order  to  account  for  the 
open  nature  of  the  contacts  [1,  6],  By  analogy,  a 
similar  expression  (with  the  terms  Pp+  —  T~ pp- 
replaced  by  is  also  used  to  express  the  linear 
relaxation  of  the  of  states  with  energy  - eV R  <  Ep 
<  -eVL.  Finally,  the  net  current  flowing  through 
the  device  is  given  by: 


(5) 


where  obviously  only  the  doubly-degenerate  states 
with  E^  >  — eVL  contribute  to  the  sum. 

Criticisms  have  been  raised  against  Eq.  (3)  based 
on  the  violation  of  continuity  [1].  Yet,  it  should  be 
noted  that  at  steady  state  the  reservoirs  and  the 
reverse  scattering-induced  transitions  provide  the 
‘missing’  current  and  current  continuity  is  trivially 
satisfied.  However,  continuity  is  indeed  violated 
when  using  the  PME  to  describe  ‘fast’  time-dep¬ 
endent  phenomena:  For  example,  any  time  depen¬ 
dent  perturbation  SV  (t)  will  cause  an  electron  in 
| fiKo)  at  t  =  0  to  have  nonzero  amplitude  also  in  a 
state  \vKfcrf).  At  long  times  the  requirement  that 
this  amplitude  be  negligible  translates  into  the 
condition  \ed6V/dt\  C  ti/r^,  expressing  the  fact 


that  the  applied  bias  cannot  change  appreciably 
over  a  relaxation  time,  so  that  the  PME  is  unable 
to  handle  nontrivial  time-dependent  phenomena. 

3.  MONTE  CARLO  SOLUTION 

OF  ONE  DIMENSIONAL  EXAMPLES 

In  order  to  implement  Eq.  (3)  in  one-dimensional 
situations,  starting  from  a  given  potential  V(z), 
such  as  a  Thomas-Fermi  solution,  the  Schrodinger 
equation  is  solved  with  vanishing  Neumann 
boundary  conditions.  The  eigenvalues  {Ep}  will 
be  spaced  densely  enough  (to  accurately  reproduce 
the  bulk  density  of  states  in  the  reservoirs)  if  the 
length  L  of  the  device  is  selected  appropriately. 
The  extended  states  can  be  finally  decomposed 
into  left-  and  right-traveling  waves  following  the 
‘quantum  transmitting  boundary  method’  by  Lent 
and  Kirkner  [10].  The  levels  \p)  are  populated 
following  Potz  [6]  and  the  Poisson  equation  can  be 
solved  again,  using  well-known  potential-damping 
schemes  to  improve  convergence.  Iterating  this 
Poisson/Schrodinger  scheme  yields  a  solution  in 
the  ballistic  case.  Scattering  is  introduced  by 
calculating  all  transition  rates  W^’o'-^Ka  between 
the  levels,  and  adding  the  PME  as  a  third  equation 
over  which  one  must  iterate.  A  Monte  Carlo 
algorithm  is  used  here.  Scattering  and  injection/ 
exit  from  contacts  are  treated  stochastically  in 
order  to  determine  the  new  populations  ppKcr. 
Typically,  300,000  ‘electrons’  are  employed,  using 
a  time  step  6t  «  10-16  s  in  the  Monte  Carlo 
algorithm.  After  about  100-500  Monte  Carlo 
steps,  the  Poisson  and  the  Schrodinger  equations 
are  solved  again  until  convergence  is  obtained. 
Finally,  the  current  flowing  through  the  device  is 
obtained  from  Eq.  (5),  while  information  about 
carrier  density,  kinetic  energy,  and  velocity  as  a 
function  of  position  inside  the  device  can  be 
obtained  from  the  expectation  values  of  the 
corresponding  operators  as  traces  over  their 
products  with  the  density  matrix. 

As  a  first  example,  let’s  consider  a  simple  nin 
resistor  at  300  K  consisting  of  a  300  nm-long 
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region  between  z  =  0  and  z  =  L  with  «-type  regions 
100  nm-long  rc-type  doped  to  1017  cm-3,  and  an 
intrinsic  region  also  100  nm  long.  A  uniform 
effective  mass  of  m*  =  0.32  m0  (where  ra0  is  the 
free  electron  mass)  and  a  dielectric  constant  e  =  1 1 .7 
evac  (where  evac  is  the  permittivity  of  vacuum)  are 
used.  Nonpolar  scattering  with  optical  phonons  is 
included  using  a  photon  energy  60  meV,  an  optical 
deformation  potential  of  5  x  108  eV/cm,  a  crystal 
density  px  —  2.33  g/cm3.  These  parameters  imply  a 
‘dephasing  length’  X#  in  reservoirs  of  the  same 
material  of  the  order  of  785  nm,  larger  than  the 
active  region  of  the  device,  about  100  nm  long. 

Figure  1  shows  the  potential,  carrier  density, 
kinetic  energy,  and  velocity  at  the  end  of  the 
iteration.  The  dashed  lines  are  the  corresponding 


POSITION  (nm) 


FIGURE  1  Potential  and  charge  density  (top  frame),  kinetic 
energy  and  drift-velocity  (bottom  frame)  as  a  function  of 
position  2  inside  an  nin  device  calculated  using  the  master  (solid 
lines)  and  Boltzmann  (dashed  lines)  equation.  The  charge 
accumulation  at  the  device  boundaries  is  an  artifact  caused  by 
boundary  conditions  employed  to  solve  the  Schrodinger 
equation. 


quantities  obtained  from  a  conventional  Monte 
Carlo  solution  of  the  BTE  using  identical  para¬ 
meters.  The  current  density  is  about  6.78  x  104  A/ 
cm2,  about  10%  smaller  than  the  value  obtained 
using  the  BTE,  7.58  x  104  A/cm2.  This  difference 
originates  mainly  from  a  slightly  higher  built-in 
barrier  at  the  cathode/intrinsic-region  junction 
seen  in  the  ‘quantum’  results,  caused  by  the 
penetration  of  wavefunctions  into  the  barrier 
itself.  The  penetration  of  charge  into  classically 
forbidden  region  is  also  the  origin  of  the  redis¬ 
tribution  of  charge  seen  in  the  top  frame  of 
Figure  1:  The  charge  density  obtained  using  the 
PME  is  larger  in  the  intrinsic  region,  as  wavefunc¬ 
tions  penetrate  into  the  gap  beyond  the  classical 
turning  point.  This  charge  redistribution  is  the 
major  factor  responsible  for  the  differences  seen  in 
average  kinetic  energy  and  drift  velocity. 

The  second  example  is  a  simple  GaAs/Al0.3_ 
Ga0.7As  resonant  tunneling  structure.  Two  150  nm 
thick  layers  of  GaAs-like  semiconductor  with  n- 
type  doping  (2  x  1017  cm-3)  are  separated  by  two 
AlGaAs  tunnel  barriers,  each  2.8  nm  thick, 
enclosing  a  4.5  nm  wide  semiconductor  well. 
Two  3  nm-thick  undoped  ‘spacer’  layers,  in  turn, 
enclose  the  double-barrier  region.  ‘Conventional’ 
parameters  for  GaAs  and  Alo.3Gao.7As  are  used 
[12],  also  including  nonpolar  scattering  with 
acoustic  phonons  and  polar  scattering  with  long¬ 
itudinal  optical  phonons.  A  lattice  temperature  of 
300  K  is  used.  The  contact  mobility  used  to 
displace  the  distributions  injected  by  the  reservoirs 
is  taken  to  be  pei  =  5000  cm2/Vs,  corresponding  to 
a  dephasing  length  in  the  reservoirs,  «  130  nm, 
larger  than  the  active  region  of  the  device.  Since 
even  at  room  temperature  scattering  is  not 
sufficiently  strong  to  fully  populate  the  states  in 
the  accumulation  layer,  as  discussed  by  Frensley 
[1],  the  curves  in  Figure  2  show  that  a  large 
fraction  of  the  applied  bias  drops  over  the  cathode 
region  to  the  left  of  the  double-barrier  region. 
Although  similar  results  have  been  obtained  by 
Frensley  [1]  and  Kluksdahl  et  al  [11],  they  appear 
poorly  credible,  since  the  potential  profile  clearly 
depends  on  the  size  of  the  simulated  region. 
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FIGURE  2  Potential  energy  vs.  position  in  a  resonant  tunnel 
diode  calculated  including  electron-phonon  scattering.  The 
curves  are  parametrized  by  the  applied  bias  from  0.0  to  0.6  V 
in  steps  of  0.05  V.  Note  the  ‘gaps’  at  the  left  of  the  barriers 
around  0.2  and  0.45  V.  Convergence  could  not  be  obtained 
close  to  these  biases,  because  of  an  instability  caused  by 
scattering,  as  discussed  in  the  text.  The  dotted  solutions  for 
biases  of  0.2  and  0.45  V  are  only  representative  of  the  average 
potential  during  the  iteration. 

Figure  3  shows  the  current-voltage  character¬ 
istics:  As  the  resonance  is  approached  (at  about 
0.2  V),  the  associated  build-up  of  charge  in  the 
well  screens  the  cathode  field,  increasing  the  field 
across  the  collector  barrier.  A  critical  point  is  soon 
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FIGURE  3  Current  density  in  the  GaAs/AlGaAs  resonant 
tunneling  structure  with  the  inclusion  of  electron-phonon 
scattering.  Note  the  ’snap  back’  of  the  current  at  biases  of  0.2 
and  0.45  V,  corresponding  to  the  instability  regions  of  Figure  2. 
The  error  bars  are  used  to  illustrate  the  range  of  the  oscillation 
of  the  current.  The  inset  shows  the  Calculated  current  density 
vs.  simulation  time  at  two  bias  points,  slightly  away  from 
(0.175  V)  and  close  to  (0.200  V)  resonance.  Note  the  instability 
of  the  current  at  the  bias  point  near  resonance. 


reached  at  which  the  rate  at  which  scattering  feeds 
charge  into  the  states  localized  in  the  well  becomes 
smaller  than  the  rate  at  which  the  well-charge  leaks 
to  the  right.  Charge  is  removed  from  the  well,  and 
the  potential  profile  now  ‘snaps’,  a  larger  voltage 
now  dropping  at  the  left  of  the  barriers.  The 
population  of  the  current-carrying  scattering 
states,  and  so  the  current  itself,  now  drops.  As 
the  applied  bias  is  increased  further,  another 
‘snapping  point’  is  reached,  at  about  0.45  V.  These 
critical  bias  points  can  be  seen  as  ‘gaps’  in  the 
cathode  potential-profile  in  Figure  2. 

As  loosely  indicated  by  the  error  bars  in  Fig¬ 
ure  3,  for  biases  close  to  and  slightly  beyond 
resonance  the  situation  is  unstable:  As  ‘resonant 
wavefunctions’  leak  out  to  the  anode,  scattering  to 
non-current-carrying  states  in  the  collector  region 
is  enhanced,  thanks  to  a  larger  overlap  (form) 
factor.  The  resulting  redistribution  of  charge  alters 
the  potential  profile,  driving  the  system  away  from 
resonance.  As  this  happens,  the  charge  in  the 
anode  side  of  the  barriers  drops  once  more,  and 
the  system  is  now  driven  back  towards  a  resonant 
condition,  and  the  cycle  repeats  once  more.  The 
inset  of  Figure  3  illustrates  some  features  of  this 
instability.  The  good  convergence  properties  of  the 
computed  current  density  for  a  bias  slightly  away 
from  resonance  (0.175  V)  are  compared  to  the 
unstable  behavior  for  a  bias  close  to  the  first 
resonance  (0.200  V).  In  the  latter  case,  with  some 
imagination  one  may  detect  an  oscillatory  behav¬ 
ior  between  two  states  [11,  13]. 

4.  SUMMARY  AND  CONCLUSIONS 

The  main  conclusion  of  this  paper  is  that  there  is 
an  interesting  class  of  problems  for  which  the  Pauli 
master  provides  a  solution  of  the  electron  trans¬ 
port  problem  overcoming  some  of  the  strongest 
limitations  of  the  semiclassical  Boltzmann  equa¬ 
tion:  For  devices  with  active  regions  smaller  than  a 
few  hundreds  of  nm,  the  master  equation  makes  it 
possible  to  account  for  the  wave  nature  of  the 
electrons,  although  only  in  the  weak  scattering 
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limit  and  in  steady  state  or  slow  (adiabatic)  time 
transients.  The  major  price  one  pays  is  the  inability 
to  handle  fast  time-transients  and  strong  scattering 
situations,  collisional  broadening  and  coherent 
transport  in  the  femtosecond  time  scale  being  the 
most  notable  examples.  The  advantages  consist  in 
the  ability  to  treat  inelastic  scattering  processes, 
arbitrarily  strong  electric  fields  (including  intra- 
collisional  field  effects)  tunneling  phenomena, 
elastic  and  not,  and,  in  general,  all  those  effects 
which  depend  on  the  wavelike  nature  of  the 
electrons. 
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Quantum  Transport  in  Open  Nanostructures 
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Electron  transport  was  studied  in  an  open  square  quantum  dot  with  a  dimension  typical 
for  current  experiments.  A  numerical  analysis  of  the  probability  density  distribution 
inside  the  dot  was  performed  which  enabled  us  to  unambiguously  map  the  resonant  states 
which  dominate  the  conductance  of  the  structure.  It  was  shown  that,  despite  of  the 
presence  of  dot  openings,  transport  through  the  dot  is  effectively  mediated  by  just  a  few 
(or  even  a  single)  eigenstates  of  the  corresponding  closed  structure.  In  a  single-mode 
regime  in  the  leads,  the  broadening  of  the  resonant  levels  is  typically  smaller  than  the 
mean  energy  level  spacing,  A.  On  the  contrary,  in  the  many-mode  regime  this  broadening 
typically  exceeds  A  and  has  an  irregular,  essentially  non-Lorentzian,  character.  It  was 
demonstrated  that  in  the  latter  case  eigenlevel  spacing  statistics  of  the  corresponding 
closed  system  are  not  relevant  to  the  averaged  transport  properties  of  the  dot.  This 
conclusion  seems  to  have  a  number  of  experimental  as  well  as  numerical  verifications. 


Keywords:  Quantum  dots,  resonant  states,  conductance  fluctuations 


In  nanoscaled  semiconductor  quantum  dots, 
electron  motion  is  confined  in  all  spatial  dimen¬ 
sions  and  the  lateral  shape  of  the  dot  can  be 
controlled  by  an  applied  gate  voltage  [1-6].  In 
high  quality  samples  at  low  temperatures  electron 
transport  is  ballistic,  i.e.,  large-angle  elastic  scat¬ 
tering  events  occur  only  at  the  boundaries  of  the 
structure  and  the  phase  coherence  length  well 
exceeds  the  dimension  of  the  device.  During  recent 
years  a  great  deal  of  effort  has  been  focused  on  the 
transport  properties  of  ballistic  microstructures.  In 
particular,  both  the  statistical  properties  of  the 
conductance  fluctuations  [1-4,  7-10]  as  well  as 
geometry-specific,  non-averaged  features  of  the 


magnetoresistance  of  quantum  dots  [1,  4,  5,  9,  11] 
have  been  extensively  studied. 

Transport  properties  of  open  microstructures 
are  often  analyzed  on  the  basis  of  the  known 
characteristics  of  the  spectrum  of  the  correspond¬ 
ing  isolated  system.  However,  when  the  dot 
becomes  open,  eigenlevels  interact  and  acquire  a 
finite  broadening  due  to  the  possibility  for 
electrons  to  escape  from  the  dot  via  lead  openings. 
With  several  propagating  modes  in  the  leads,  this 
broadening  might  well  exceed  the  mean  energy 
level  separation,  A,  resulting  in  an  overlap  of  a 
vast  number  of  resonances.  Also,  the  presence  of 
dot  openings  may  cause  a  significant  distortion  of 
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corresponding  eigenstates.  Under  these  circum¬ 
stances  it  is  not  a  priori  evident  whether  a 
discussion  of  transport  through  an  open  dot  on 
the  basis  of  the  properties  of  the  Hamiltonian  of 
the  closed  structure  is  still  meaningful.  To  the  best 
of  our  knowledge,  up  to  now  no  direct  theoretical 
calculations  on  the  actual  broadening  of  the 
resonant  levels  for  the  open  dots  in  the  transmis¬ 
sive  regime  are  available. 

Besides,  in  the  current  literature  there  exists  a 
number  of  conflicting  reports  on  the  effects  of 
leads  on  the  character  of  electron  dynamics  in 
open  systems  (chaotic  vs  regular).  In  particular, 
[12]  shows  that  the  statistics  of  the  spectra  for  open 
dots  are  exactly  the  same  as  those  of  the 
corresponding  closed  system.  At  the  same  time, 
results  [9,  10,  13]  suggest  that  the  leads  attached  to 
the  dot  may  change  the  level  statistics,  so  that 
transition  to  chaos  can  occur  in  a  nominally 
regular  system.  On  the  contrary,  Wang  et  al  [8] 
conclude  that  the  openness  of  the  dot  makes 
chaotic  scattering  non-essential. 

In  this  paper,  on  the  basis  of  our  direct  mapping 
of  resonant  states  performed  for  an  open  square 
dot,  we  hope  to  contribute  to  the  clarification  of 
some  fundamental  issues  in  this  context. 

The  system  under  investigation  is  a  relatively 
large  square  dot  with  the  side  L  =  1  pm  which  is 
typical  for  current  experiments.  It  is  connected  to 
reservoirs  by  quantum  point  contact  (QPC)-like 
openings  (leads),  see  Figures  1-3.  For  the  sake  of 
simplicity,  hard  wall  confinement  and  a  flat 
potential  profile  inside  the  dot  are  assumed  which 
seems  to  be  a  good  approximation  for  large  dots 
[11].  We  disregard  effects  of  the  soft  impurity 
potential  due  to  remote  donors  as  well  as  inelastic 
scattering  events. 

Conductance  through  the  dot  in  perpendicular 
magnetic  field  B  at  finite  temperature  is  calculated 
within  the  Landauer-Biittiker  formalism  [14], 
where  a  transmission  probabilities  and  wave 
functions  were  computed  by  making  use  of  the 
recursive  Green  function  technique  [15].  Analyzing 
the  probability  density  distribution  inside  the  dot 
we  are  in  a  position  to  identify  resonant  energy 


FIGURE  1  Lower  left:  The  conductance  of  the  square  dot 
(schematically  depicted  in  the  upper  panel)  in  the  tunneling 
regime  as  a  function  of  the  sheet  electron  density  EFm*/nti2. 
Temperature  T—  0.  Shadow  regions  in  the  leads  represent 
tunneling  barriers  with  the  height  exceeding  the  Fermi  energy. 
Lower  right:  Eigenenergy  levels  of  the  isolated  dot.  Height  of 
the  peaks  represents  the  degree  of  degeneracy  (1  or  2).  Upper 
panel  shows  the  calculated  probability  density  distribution 
\^mn\2  inside  the  dot  for  one  of  the  tunneling  peaks  (left)  and 
the  corresponding  numerical  results  for  the  coefficients  \cmn\2 
calculated  on  the  basis  of  Eq.  (1)  (right).  Dashed  lines  indicate 
the  circle  with  the  radius  r  =  kpLj'n.  A  similar  analysis  has  been 
done  for  the  rest  of  the  peaks  and  the  correspondence  between 
eigenstates  of  the  isolated  square  and  resonant  levels  of  the  dot 
is  indicated  by  the  arrows.  Quantum  numbers  of  the  resonant 
states,  {m,  n),  are  shown  in  the  parenthesis.  In  the  case  under 
consideration  the  side  of  the  dot  was  chosen  to  be  L  =  0.5  pm. 


states  which  effectively  mediate  transport  through 
the  dot  at  a  given  EF.  To  do  this,  we  numerically 
expand  the  solution  of  the  scattering  problem  in 
the  open  dot,  ^(x,t;  E ),  in  the  set  of  eigenstates  of 
the  closed  dot,  =^sin^sin^  (with  eigen- 
energies  emn  =  ti2/2m*(k2m  +k2n);km  =  ?f,  k„  =  f) 


(*,  y\ E)  =  -  Cmn  (E)sin  L 


7 mix  .  my 
- sin  -j~ . 


(i) 


Coefficients  cmn  represent  the  contributions  of  the 
eigenstates  m ,  n  in  the  total  wave  function. 
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FIGURE  2  The  conductance  of  the  square  dot  (schematically 
depicted  in  the  inset)  as  a  function  of  the  sheet  electron  density 
ns  =  Epm* fn ft2.  The  side  of  the  dot  L  =  1  pm;  temperature 
T=  0.  The  lead  openings  support  one  propagating  mode.  Insets 
show  calculated  inside  the  dot  for  two  representative 

values  of  ns  and  the  corresponding  numerical  results  for  the 
coefficients  |cm„|2  calculated  on  the  basis  of  Eq.  (1).  Dashed 
lines  indicate  the  circle  with  the  radius  r  =  kFL/'K .  (b) 
Dependence  of  the  coefficients  |cm„|2  identifying  the  dominant 
resonant  states  with  the  quantum  numbers  (m,  n)  on  the  sheet 
electron  density  ns.  A  horizontal  bar  indicates  the  mean  energy 
level  spacing  A.  (c)  Dwell  time  of  the  dot. 

In  what  follows  we  focus  on  the  three  different 
transport  regimes,  namely  the  tunneling,  the 
single-mode,  and  the  many-mode  regimes. 

Tunneling  regime  Figure  1(a)  shows  conductance 
of  the  square  dot  in  a  regime  when  the  dot  is 
weakly  coupled  to  the  leads.  Each  conductance 
peak  corresponds  to  an  excitation  of  one  single 
resonant  energy  level  which  effectively  mediates 
transport  at  the  given  Fermi  energy.  Near  its 
maximum,  each  peak  is  characterized  by  the 


FIGURE  3  The  conductance  of  the  square  dot  (see  inset)  in  a 
many-mode  regime  in  the  lead  openings,  N=  5.  The  side  of  the 
dot  L=  1  pm;  temperature  T=  0.  (b)  Coefficients  |cOT„|2 
identifying  the  dominant  resonant  states  with  the  quantum 
numbers  (m,  n).  A  horizontal  bar  indicates  the  mean  energy 
level  spacing  A.  (c),  (d)  The  same  as  (b)  but  in  a  refined  scale. 
(The  contribution  from  the  three  dominant  states  indicated  in 
(b)  is  not  shown),  (e)  Dwell  time  of  the  dot. 

Lorentzian  shape,  in  accordance  to  the  Breit- 
Wigner  formalism.  The  positions  of  the  peaks  are 
shifted  with  respect  to  the  corresponding  eigen- 
energy  levels  of  the  isolated  square  (cf.  Figs.  1(a) 
and  (b)). 

Single-mode  regime  Figure  2(a)  shows  the  con¬ 
ductance  of  the  square  dot  where  lead  openings  are 
adjusted  to  support  one  propagating  mode.  Here 
the  pattern  of  the  probability  density  distribution 
|^mn|2  exhibits  a  complicated  structure,  where 
eigenstates  of  the  isolated  square  are  not  easily 
recognized.  Thus,  a  numerical  analysis  on  the  basis 
of  Eq.  (1),  in  contrast  to  the  tunneling  case,  is 
essential.  Calculating  the  expansion  coefficients 
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cmn,  we  find  that,  at  the  given  kF,  only  those 
coefficients  associated  with  the  circle  in  the  fc-space 
with  the  radius  R  =  kF&  y/2rn*emn/h  =n Vm2  +  n2 
/ L  are  distinct  from  zero,  see  Figure  2(a).  (Note 
that  non-vanishing  contributions  from  other  coef¬ 
ficients  would  indicate  that  eigenstates  of  the 
isolated  dot  are  essentially  distorted  by  the  lead 
openings  such  that  a  discussion  of  the  transport  in 
open  structure  in  terms  of  eigenstates  of  the 
isolated  dot  does  not  make  sense).  Typically,  we 
find  that  only  a  few  (and  sometimes  even  a  single) 
coefficients  give  a  dominant  contribution.  A 
broadening  of  the  resonant  levels  due  to  the  effect 
of  the  dot  openings  in  the  A>space  is  less  than  the 
distance  between  neighboring  eigenstates  whose 
quantum  numbers  differ  by  one,  A k  =  \kn  -  k„± i| 
=  |  km  -  km±  i|  =  7 r/L.  Therefore,  we  conclude  that 
despite  the  presence  of  dot  openings ,  transport 
through  the  structure  is  still  effectively  mediated 
by  a  few  eigenstates  of  the  corresponding  closed  dot 
with  the  eigenenergies  lying  in  close  proximity  to  the 
Fermi  energy ,  emn  =  EF. 

Calculating  the  coefficients  cmn  as  a  function  of 
the  Fermi  energy,  we  extract  information  about 
the  lead-induced  broadening  of  the  energy  levels  of 
the  dot.  A  contribution  of  the  dominant  resonant 
energy  states  is  shown  in  Figure  2(b).  The  mean 
energy  level  spacing,  A  =  2Tr7z2/ra*L2  ~  70  mK,  is 
indicated  by  a  horizontal  bar.  In  contrast  to  the 
tunneling  regime,  the  lineshape  of  \cmin(E)\2  can  be 
non-Lorentzian.  Moreover,  different  states  are 
characterized  by  different  broadenings  and  they 
may  overlap  with  each  other.  However,  a  broad¬ 
ening  (half-width)  of  the  resonant  energy  levels  is 
typically  less  than  A.  Therefore,  transport  mea¬ 
surements  at  very  low  temperatures  in  a  single-mode 
regime  in  the  leads  may  probe  a  single  resonant 
energy  level  of  the  dot . 

A  comparison  between  Figures  2(a)  and  (b) 
shows  that  features  in  the  conductance  of  the  dot 
are  related  to  excitations  of  the  particular  eigen¬ 
states  of  the  square.  However,  this  correspondence 
is  rather  complicated:  different  eigenstates  can  be 
responsible  for  opposite  features  in  the  dot 
conductance  (dips  and  peaks). 


Many-mode  regime  As  the  lead  openings  become 
wider,  a  number  of  the  resonant  states  excited  in 
the  dot  increases.  Nevertheless,  like  in  the  single¬ 
mode  regime,  at  the  given  Fermi  energy,  a  non 
vanishing  contribution  comes  only  from  the  coeffi¬ 
cients  which  lie  in  the  closest  proximity  to  the  circle 
with  the  radius  kF  in  the  k-space.  Therefore,  even  in 
a  many-mode  regime,  transport  through  an  open 
structure  is  still  effectively  mediated  by  eigenstates 
of  the  corresponding  closed  dot.  A  broadening  of 
the  resonant  energy  levels  increases  with  an 
increase  of  the  lead  openings.  In  Figure  3  wave 
function  patterns  are  analyzed  on  the  basis  of  Eq. 
(1)  and  contributions  from  dominant  states  are 
shown  in  a  representative  interval  of  the  Fermi 
energy.  Typically,  several  states  dominate  trans¬ 
port  at  a  given  EF.  A  broadening  of  the  energy 
levels  has,  as  a  rule,  a  complicated  essentially  non- 
Lorentzian  character  with  half-width  being  differ¬ 
ent  for  different  states.  In  contrast  to  the  single 
mode  regime,  a  half-width  of  the  resonant  energy 
levels  is  typically  larger  than  the  mean  energy  level 
spacing,  A.  Comparing  the  conductance  of  the 
dot,  Figure  3;  and  the  dependence  cmn  =  cmn(EF ), 
one  can  trace  a  certain  correspondence  between 
the  two.  However,  because  many  eigenstates 
typically  contribute  to  the  conductance  at  a  given 
EFi  a  detailed  explanation  of  the  features  of  the 
dot  conductance  is  not  possible. 

In  what  follows  we  critically  examine  several 
approaches  [9,  12]  to  the  analysis  of  the  statistics 
of  the  spectra  of  open  dots.  A  statistical  analysis  of 
the  distribution  of  the  energies  at  resonances  of 
conduction  fluctuations  in  chaotic  stadium  and 
regular  circular  billiards  has  been  performed  by 
Is.hio  [9].  In  both  billiards  the  statistics  follows 
Wigner-type  distribution  which  was  taken  as  an 
indication  of  the  transition  to  chaos.  (The  Wigner- 
type  statistics  of  the  nearest  energy  level  spacing 
distribution  is  characteristic  of  the  classically 
chaotic  closed  billiards).  With  regard  to  this 
analysis,  the  question  immediately  arises  “does 
the  resonance  energy  statistics  of  the  conductance 
fluctuations  reproduce  the  corresponding  statistics 
of  the  isolated  system?”  We  have  shown  above 
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that  even  in  a  single  mode  regime,  resonant 
energies  in  the  conductance  fluctuations  only 
occasionally  correspond  to  resonant  eigenstates 
of  the  isolated  dot.  Instead,  in  most  cases  resonant 
energies  are  related  to  those  energies  when  more 
than  one  state  is  simultaneously  excited  in  the  dot 
such  that  their  mutual  interference  leads  to  the 
resonance  behavior  of  the  transmission  coefficient. 
Therefore,  in  our  opinion,  Wigner-type  statistics  of 
the  spacing  of  the  conductance  fluctuation  reson¬ 
ances  cannot  be  taken  as  an  indication  of  the 
transition  to  chaos  in  a  nominally  regular  but  open 
system. 

Wang  et  al.  [12]  analyzed  statistics  of  the  open 
system  on  the  basis  of  the  calculated  electron  dwell 
time,  f  ds\S5fmn\2,  which  identifies  the  time  an 

electron  spends  inside  the  dot;  in  the  above 
definition  an  integration  is  performed  within  the 
dot  area.  Statistics  of  the  spectra  were  found  to  be 
exactly  the  same  as  that  of  the  corresponding 
closed  system.  Again,  a  similar  question  arises 
“does  the  dwell  time  maxima  (which  is  integrated 
characteristic)  unambiguously  identify  individual 
resonant  states  of  the  open  dot?”. 

Figures  2(c)  and  3(c)  show  the  calculated  dwell 
time  in  the  square  dot  in  the  single-  and  many¬ 
mode  regimes  respectively.  In  a  single-mode 
regime  all  maxima  in  the  dwell  time  do  correspond 
to  the  resonant  eigenstates,  (cf.  Figs.  2(b)  and  (c)). 
However,  dwell  time  does  not  identify  resonant 
states  unambiguously  because  some  eigenstates  are 
overlooked  by  this  analysis.  Nevertheless,  since  the 
number  of  missing  states  is  usually  small  our 
analysis,  as  far  as  a  single  mode  regime  is 
concerned,  tends  to  support  the  conclusion  [12] 
that  the  statistics  of  the  dwell  time  spectra  for  open 
dots  are  the  same  as  that  of  the  corresponding 
closed  systems. 

In  a  many-mode  regime,  our  analysis  strongly 
suggest  that  the  resonant  level  spacing  statistics 
become  ill-defined.  This  is  not  only  because  the 
broadening  of  resonant  states  typically  exceeds 
mean  level  spacing  (for  Lorentzian  broadening  one 
can  still  define  statistics  of  the  spacings  between 


peak  maxima).  This  is  due  to  the  fact  that  the 
broadening  itself  in  many  cases  is  essentially  non- 
Lorentzian,  see  Figure  3(c).  For  most  of  the 
resonant  states  the  concept  of  statistics  of  the 
spectra  does  not  make  any  sense,  since  it  is  not 
possible  to  introduce  any  reasonable  definition  of 
the  spacing  between  resonances.  Therefore,  we 
conclude  that  for  the  quantum  dots  strongly  coupled 
to  the  leads  with  several  modes  available  in  the  lead 
openings,  eigenlevel  spacing  statistics  of  the  corre¬ 
sponding  closed  system  are  not  relevant  to  the 
averaged  transport  properties  of  the  structure. 

This  conclusion  seems  to  have  a  number  of 
experimental  as  well  as  numerical  verifications. 
The  difference  between  statistical  properties  of  the 
conductance  oscillations  in  a  chaotic  (stadium) 
and  a  regular  (circular)  dots  has  been  studied  by 
Marcus  et  al.  [1].  Corresponding  averaged  auto¬ 
correlation  functions  are  almost  identical  over  the 
two  decades  of  decay,  although  they  exhibit  quan¬ 
titative  distinctions  in  the  tail.  The  data,  from  the 
similar  studies  of  Berry  et  al.  [2]  for  chaotic  (circu¬ 
lar  with  a  bar)  and  regular  (circular)  dots,  does  not 
show  any  significant  discrepancy  over  the  four 
orders  of  magnitude  in  power.  Numerical  studies 
of  autocorrelations  functions  for  a  chaotic  (sta¬ 
dium)  and  a  regular  (circular)  dots  [8]  in  a  many¬ 
mode  regime  show  the  similar  behavior,  which  is  in 
accordance  with  our  arguments. 

To  conclude,  despite  of  the  presence  of  dot 
openings,  transport  through  the  open  dot  is 
effectively  mediated  by  just  a  few  eigenstates  of 
the  corresponding  closed  structure.  In  a  single¬ 
mode  regime  in  the  leads  the  broadening  of  the 
resonant  levels  is  typically  smaller  than  the  mean 
energy  level  spacing,  A.  On  the  contrary,  in  the 
many-mode  regime  the  broadening  exceeds  A  and 
has  essentially  a  non-Lorentzian  character. 
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Application  of  the  Wigner-Function  Formulation  to 
Mesoscopic  Systems  in  Presence  of  Electron-Phonon 

Interaction 
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A  theoretical  and  computational  analysis  of  the  quantum  dynamics  of  charge  carriers  in 
presence  of  electron-phonon  interaction  based  on  the  Wigner  function  is  here  applied  to 
the  study  of  transport  in  mesoscopic  systems.  Numerical  applications  are  shown  for  a)  a 
wave  packet  scattering  with  phonons  while  crossing  a  potential  profile  and  b)  electrons 
scattering  with  phonons  in  a  finite  device  with  open  boundary  conditions. 


Keywords:  Quantum  transport,  Wigner  function,  mesoscopic  systems,  boundary  conditions 


1.  INTRODUCTION 

A  rigorous  quantum  transport  theory  of  electrons 
in  semiconductor  structures  including  both  coher¬ 
ent  propagation  and  scattering  mechanisms  is  still 
lacking  even  though  its  need  has  been  recognized 
since  many  years.  In  particular  we  miss  a  theory 
feasible  for  numerical  calculations  to  be  compared 
with  experimental  results  and/or  with  results  of  the 
semiclassical  theory  with  the  purpose  of  enlighten¬ 
ing  the  different  predictions  of  the  two  models. 

In  the  literature  different  approches  have  been 
used  [1-3].  The  Wigner  function  (WF)  [1] 
provides  a  rigorous  quantum-mechanical  tool 
and  constitues  a  direct  link  between  quantum 


and  classical  descriptions  of  the  evolution  of  the 
system  in  phase  space.  In  this  approach  however 
electron-phonon  interaction  has  been  treated  using 
either  relaxation  time  [4,  5]  or  a  classical  Boltz¬ 
mann  collision  operator  [6].  In  this  paper  several 
advancements  in  the  effort  to  solve  the  above 
problem  exactly  are  presented. 

The  definition  of  the  WF  can  be  extended  to 
include  electrons  interacting  with  phonons  as 
follows  [7]:  r 

fw(r,p,nq,n’q,t)  =  ^J  dr'e ~‘V 

p{r  +  r' /2,nq,r  -  r* /2,n'q) 
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where  r  and  p  are  the  electron  position  and 
momentum,  p  is  the  density  matrix  for  the  system 
formed  by  an  electron  and  the  phonon  gas,  and  nq 
is  the  occupation  number  of  the  phonon  mode  q. 
Trace  over  the  phonon  variables  will  lead  to  the 
traditional  electron  WF. 

In  this  paper  we  discuss  an  integral  equation  for 
the  above  WF  and  apply  it  to  study  quantum 
electron  transport  in  open  systems  in  presence  of 
scattering.  In  particular  suitable  boundary  condi¬ 
tions  can  be  used  which  restrict  the  analysis  of  the 
WF  inside  a  finite  domain.  Wigner  trajectories 
have  been  identified  also  for  the  case  of  electron- 
phonon  interaction  and  constitute  a  guiding 
criterion  for  replacing  initial  conditions  with 
boundary  conditions  for  the  integral  equation. 


2.  INTEGRAL  EQUATION 

Let  us  consider  a  system  described  by  the 
Hamiltonian 

H=H0  +  H'  (2) 

where  H0  is  the  “unperturbed”  Hamiltonian, 
containing  the  description  of  the  free  electrons, 
free  phonons  and,  in  the  present  approach,  also 
the  potential  profile  inside  the  system  (device).  Hr 
contains  the  scattering  agents,  i.e.,  the  electron- 
phonon  interaction. 

The  time  evolution  of  the  density  matrix  is 
described  by  the  Liouville-von  Neumann  equation. 
Starting  from  this  equation  we  derived  the  follow¬ 
ing  integral  equation  for  the  WF: 

i. 

fw(r,p,ng,rfq,  t)  =  h3 

n,nf 

**))(/-*)  J  dr'  J  dp Tnn, (r ',/>') 

xfw{r',p',nq,n'q,  to)  +  h3  Y^fnn’{r,p) 

[  (3) 

j  to 


x  /  dr'  /  dp* {H'(nnq,mmq) 

mmq  J  J 

fmn'(r'iP')fw{*,,p',mq,riq,t') 

-f*m,(r',p')fw(r',p',nq,mq,t,)H'(mmq,ririq)} 

In  the  above  equation  H!  =  Hf /ih,  and  1i!{nnq, 
mmq)  are  its  matrix  elements  on  the  basis  {\</>b  nq)} 
of  /70.  The  coefficients  flm  (r,  p)  are  given  by 

M',P)=#  J  dr'e-W*V^r  +  j\4>^(^\r-^ 

(4) 

They  allow  to  move  from  the  WF  to  the  density- 
matrix  representation  and  viceversa.  Finally 
h u>{n,nq)  indicates  the  total  unperturbed  energy 
of  the  electron-phonon  system  when  the  electron  is 
in  the  n- th  eigenstate  and  the  phonon  bath  is  in  a 
state  with  n  phonons  in  mode  q . 

In  Eq.  (3)  the  first  term  in  the  r.h.s.  describes  the 
ballistic  coherent  propagation  from  the  initial  time 
t0  to  the  observation  time  t.  The  second  term 
describes  the  contribution  of  the  unknown  WF  to 
the  “last”  interaction  vertex  at  any  time  tf  between 
to  and  t  followed  by  the  free  propagation  from  t’ 
to  t .  Even  though  the  equation  is  written  for  the 
entire  system  and  contains  also  the  phonon 
variables,  it  has  been  shown  that  a  Monte  Carlo 
solution  allows  for  a  proper  average  over  the 
phonon  coordinates,  as  long  as  hot-phonon  effects 
are  neglected  [8]. 

3.  WIGNER  TRAJECTORIES 

It  is  well  known  that  a  point-like  electronic  WF  in 
phase  space  cannot  represent  a  real  physical 
system,  since  it  violates  the  uncertainty  principle. 
We  may  however  consider  the  evolution  in  time  of 
a  8 -like  contribution  to  the  WF  inside  Eq.  (3). 
For  free  electrons  its  trajectory  in  phase  space  is 
the  same  as  for  semiclassical  particles.  In  fact,  for  a 
plane-wave  basis  the  coefficients  fim  are  given  by 

fkk'{r,P)  =  ^  [P~\{k  +  k'])  (5) 
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and  the  ballistic  evolution  of  the  WF  as  given  by 
the  first  term  on  the  rhs  of  Eq.  (3)  yields: 

fw{r,p,  nq ,  n'q,t)  =fw (r  -  ^  (/  -  to ),/>, «<?,  n'q,  to) 

e-i(u(nq)-u{riq)){t-to)  (6) 


where  m  is  the  electron  effective  mass.  The  first 
factor  and  the  exponential  factor  describe  the  free 
trajectory  of  the  electron  and  the  time  evolution  of 
the  free-phonon  bath,  respectively.  This  result  is 
not  surprising  since  in  absence  of  scattering  and  up 
to  quadratic  potentials  (harmonic  oscillator)  a 
differential  equation  can  be  written  for  the  WF 
that  coincides  with  the  Boltzmann  equation,  so 
that  the  dynamical  evolution  of  the  WF  follows 
the  same  trajectories  of  the  representative  points  of 
a  classical  gas.  This  implies  a  deformation  of  the 
WF  while  evolving  in  time  due  to  the  fact  that,  as 
time  increases,  the  higher-momentum  components 
move  faster  than  the  lower-momentum  compon¬ 
ents. 

If  phonon  scattering  is  included,  similar  trajec¬ 
tories  exist,  with  very  interesting  properties,  as 
long  as  a  single  interaction  diagram  is  considered 
without  time  integrations.  In  order  to  prove  this 
result  Eq.  (3)  must  be  substituted  into  itself  to 
obtain  the  second-order  term  corresponding  to 
one  scattering  event.  Then  assuming  a  constant 
potential  and  using  again  plane  waves  as  basis  set, 
the  second-order  correction,  e.g.,  for  the  terms 
corresponding  to  phonon  emission,  results  to  be 


&fw{z,Pz,nq,nq,t)  =  2K 

f  dt"e-iq 
Jto 


' to 

y-fv 


(z-P-«- 

\  m 


i\  Pz  +  hq/ 2,., 


o 


m 


{t'-t") 


- ('"  -  to) ,Pz  +  -hq,ng-l,nq-l,M  (7) 


where  for  simplicity  a  one-dimensional  case  is 
considered  and  F(q )  is  the  coupling  function 
entering  the  electron-phonon  hamiltonian  [9], 


FIGURE  1  Example  of  the  Wigner  trajectories  that  con¬ 
tribute  to  the  result  in  Eq.  (3)  for  real  and  virtual  emissions 
of  a  mode  q. 


The  trajectories  described  by  the  above  equation 
are  represented  in  the  diagrams  contained  in 
Figure  1.  They  correspond  to  the  semiclassical 
trajectories  where  q!2  is  transferred  to  the  electron 
at  each  vertex.  As  a  result  of  the  time  integrations, 
however,  a  sum  over  infinite  trajectories  gives  the 
WF  correction  corresponding  to  a  single  scattering 
event. 


4.  INITIAL  AND  BOUNDARY  CONDITIONS 

Guided  by  the  considerations  of  the  previous 
section  we  face  the  problem  of  finding  an  equation 
equivalent  to  Eq.  (3)  for  the  WF  in  a  finite  region 
inside  given  boundaries.  The  problem  is  not  trivial 
owing  to  the  nonlocality  of  the  interaction  and  to 
the  integration  from  — oo  to  +  oo  in  the  integral  over 
z'.  We  have  proved  that  for  the  solution  of  the 
integral  equation  it  is  possible  to  substitute  the 
knowledge  of  the  WF  over  all  space  at  a  given 
(initial)  time  t0  with  the  knowledge  of  the  same 
function  inside  the  region  of  interest  at  t0  and  at  all 
times  at  the  boundary  for  “entering”  momenta.  The 
integral  equation  for  such  a  situation  results  to  be 

fw{z,Pz,nq,n'q,t )  = 

h  Y,fnn' 
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J dp'zf*n„<{z' ,p'z)fw{z' ,p'z,nq,n'q,  to) 

+  h  Y,fnn'(z,Pz)  f 

tin* 

{[  jr+^-^a 

xfw(-A,p'z,nq,n'q,t')-  [  dp'2^f*m,{A,p'z) 

J  —  oo  "7 

fw(A,p'z,nq,n'q,t') 

+  f  dz'  [  dP'zJ2  [n’(m^mmg)fL'(z',P'z) 

d -A  J 

fw(^,p'z,mq,nq,  t')  —f*nm(z',p'z) 

fw(z',p'z, nq,  mq,  t')H'(mmq ,  «'«;)]}•  (8) 

where  again  for  simplicity  a  one-dimensional  case 
has  been  considered,  and  -A  and  A  are  the  device 
boundaries. 


5.  APPLICATIONS 

As  a  case  study  we  have  first  analysed  a  single 
electron-phonon  scattering  event  for  a  wave 
packet  crossing  a  potential  profile  [10].  In  this 
case  the  numerical  procedure  is  simplified  by  the 
fact  that  the  whole  system  can  be  enclosed  in  a 
large,  but  finite  box.  Thus  the  basis  set  of  functions 
is  discrete  and  can  be  used  without  finite-difference 
approximations  in  A:-space.  Three  cases  have  been 
considered  for  a  single  electron-phonon  scattering 
event: 

a)  The  effect  of  a  quantum  collision  over  the  free 
propagation, 

b)  Intra-collisional  field  effect  when  the  phonon 
interaction  takes  place  during  a  propagation 
through  a  region  with  a  constant  electric  field, 
and 

c)  Scattering  during  the  otherwise  resonant  pro¬ 
pagation  through  a  double  barrier. 


As  an  example  Figure  2  shows  the  scattering 
and  scattered  WF  at  t  =  1.5ps  after  the  initial 
condition  for  an  electron  crossing  a  double  barrier 
in  resonant  conditions  and  emitting  an  optical 
phonon.  The  interaction  is  switched  on  when  the 
packet  hits  the  double  barrier,  namely  at  t  =  0.5  ps 
after  the  initial  condition.  A  complex  quantum- 


FIGURE  2  Unperturbed  Wigner  function  (above)  and 
second-order  correction  due  to  a  quantum  phonon  emission 
(below)  for  an  electron  crossing  a  double-barrier  potential 
profile  in  resonant  conditions  at  different  times.  The  interaction 
is  active  from  /  =  0.5  ps  after  the  initial  condition. 
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dynamical  interference  between  potential  profile 
and  scattering  is  observed  at  successive  times  [10]. 

In  order  to  verify  the  correctness  of  the  use  of 
the  boundary  conditions  for  an  open  mesoscopic 
system  as  described  in  Section  4  we  have  first 
evaluated  the  ballistic  evolution  of  the  first  term  in 
Eq.  (8)  assuming  a  step  potential  profile, /„  =  0  at 
t  =  to  inside  the  device,  and  a  Maxwellian  local- 
equilibrium  incoming  distribution  at  the  left  and 
right  boundaries.  Results  are  shown  in  Figure  3  at 
different  times  after  the  initial  condition. 

At  the  longest  times  (not  shown  in  the  figure  for 
space  reasons)  we  recover  for  fw  the  same  result 


which  is  obtained  without  using  boundary  condi¬ 
tions  for  a  stationary  infinite  system,  where  fw  is 
evaluated  by  means  of  an  equilibrium  density 
matrix  diagonal  over  the  scattering  states. 

In  order  to  describe  quantum  transport  across  a 
very  small  device  in  presence  of  scattering  the 
integral  in  Eq.  (8)  is  expanded  perturbatively 
through  iterative  substitutions.  The  second-order 
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FIGURE  4  In-emission  term  summed  over  all  phonon  modes 
q  at  t  -  50  fs  after  the  electron-phonon  interaction  has  been 
switched  on. 


FIGURE  3  Ballistic  evolution  of  the  first  term  in  Eq.  (8)  at 
two  times  (t  =  10  fs  and  80  fs)  for  electrons  entering  from  the 
boundaries  into  a  region  with  a  potential  step. 


position  (10  m) 

FIGURE  5  Contribution  to  the  Wigner  function  due  to  real 
emission  of  a  mode  q  =  1.66  •  108  m  -1  at  200  fs  after  the 
electron-phonon  interaction  has  been  switched  on.  Only 
electrons  entering  with  positive  p  are  able  to  emit.  At  long 
times  classical  conservation  is  recovered. 
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contribution  is  the  sum  of  four  terms:  real  and 
virtual  absorptions  and  emissions.  As  an  example 
Figure  4  shows  the  result  for  the  in-emission  term 
summed  over  all  phonon  modes  q  which  compares 
very  well  with  the  classical  result. 

An  interesting  difference  instead  emerges  when 
only  a  single  mode  q  is  considered  in  the  quantum 
calculation,  as  shown  in  Figure  5.  At  short  times 
right  after  the  entrance  energy  non-conservation 
modifies  the  in-scattering  term. 
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Quantum  Networks:  Dynamics  of  Open  Nanostructures 
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The  superposition  principle  makes  quantum  networks  behave  very  differently  from  their 
classical  counterparts:  We  discuss  how  local  and  non-local  coherence  are  generated  and 
how  these  may  affect  the  function  of  composite  systems.  Numerical  examples  concern 
quantum  trajectories,  quantum  noise  and  quantum  parallelism. 


Keywords:  Coherence,  entanglement,  quantum  trajectories,  quantum  parallelism 


1.  GENERAL  BACKGROUND 

Beyond  some  15  years  from  now  any  realistic 
roadmap  for  nanoelectronics  [1]  tends  to  become 
rather  vague:  The  present  trend  towards  miniaturi¬ 
zation  is  expected  to  continue  till  then,  but  as 
physics  is  not  scale-invariant,  fundamental  limita¬ 
tions  will  be  piling  up,  all  of  which  are  eventually 
related  to  quantum  features  of  some  sort:  there  are 
limits  of  manufacturing,  limits  of  control,  limits  of 
simulation  capabilities,  limits  of  classical  system 
theory,  and  limits  of  classical  models  of  computa¬ 
tion,  to  name  but  a  few. 

We  will  not  be  able  to  discuss  these  issues  here  at 
any  depth;  instead,  we  intend  to  address  some 
pertinent  questions  based  on  “toy-models”  rather 
than  sophisticated  device  simulations.  Neverthe¬ 
less,  this  approach  should  be  able  to  demonstrate 
in  what  sense  quantum  networks  will  differ  from 
classical  ones. 


1.1.  How  Can  We  Design  Quantum  Systems? 

Stable  structure,  though  eventually  a  quantum 
phenomenon  in  its  own  right,  enters  nanophysics 
as  a  “classical”  design  tool:  Confinement  of 
electron-(or  photon-)  fields  allows  to  discretize 
the  state  space,  at  least  in  some  energy  range  of 
interest.  This  “quantum-size-effect”  considerably 
simplifies  the  specification  and  observation  of  the 
non-classical  features  to  be  discussed  below,  as  it 
significantly  reduces  the  number  of  “subsystems” 
to  be  considered. 

1.2.  What  is  a  Subsystem? 

It  turns  out  that  the  only  viable  definition  of  a 
subsystem  is  operational:  A  subsystem  has  to  be 
separated  out  by  the  classical  environment  (experi¬ 
mental  set-up)  in  terms  of  measurement  modes.  A 
quantum  dot  or  an  atom  (within  an  array)  will  be  a 
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subsystem  if  and  only  if  it  is  selectively  “con¬ 
tacted”.  Note  that  if  this  condition  is  not  fulfilled, 
the  array  has  to  be  treated  as  a  unit;  though  this 
will  not  change  the  total  state  space,  it  will 
certainly  change  the  type  of  possible  observations. 

1.3.  How  do  we  Describe  Subsystems? 

We  restrict  ourselves  to  subsystems  //  characteri¬ 
zed  by  n  different  states  | p^),  each.  The  number  of 
independent  observables  (represented  by  a  set  of 
orthonorm alized  operators  Xj(p))  is  then  restricted 
to  «2,  i.e.,  y  =  0  (denoting  the  unit-operator:  “no 
action”),  1,. .  .,n2- 1.  Any  operator  can  be  written 
as  some  linear  combination  of  these.  We  may 
choose  the  Ay  such  that  the  expectation-values  (Ay) 
form  a  vector  [2],  [3],  for  n  =  2  the  so-called  Bloch- 
vector. 

1.4.  How  do  we  Describe  Networks? 

Any  operator  (for  a  network  composed  of  N 
subsystems,  say)  can  be  written  as  a  linear 
combination  of  the  n2N  “cluster-operators” 

Qjk..,  =  A/(l)Afc(2) . . .  Xr(N)  (1) 

These  Q,  in  general,  do  not  commute;  they  come 
in  different  classes,  specified  by  the  number  m<N 
of  subsystems  which  are  actually  affected  (which 
equals  the  number  of  indices  unequal  zero):  Q00...0 
is  the  unit-operator,  Qj0k_ 0  an  operator  acting  on 
fi=  1,3  (compact  notation  Ay(l)A^(3)),  etc.  It  is 
important  to  realize  that  our  definition  of  sub¬ 
systems  guarantees  that  expectation  values  ( Q )  of 
any  such  ra-cluster-operator  can  be  obtained  from 
a  series  of  respective  coincidence  experiments 
(ensemble-  or  time-averaged). 

1.5.  Can  we  Reconstruct  any  State? 

The  density  matrix  is  uniquely  specified  once  the 
expectation  values  of  this  full  hierarchy  of  cluster- 
operators  are  given  [3].  As  the  number  of 
independent  state  parameters  grows  exponentially 


with  A,  complete  state  reconstruction  [4]  tends  to 
be  severely  limited.  So,  N  needs  to  be  small,  as 
incomplete  measurements  are  likely  to  miss  non- 
classical  features  all  together.  (This  is  why  our 
world  appears  so  classical). 

1.6.  What  is  a  “Classical”  State? 

Within  an  entirely  quantum  mechanical  treatment 
the  notion  of  a  “classical”  state  is  somewhat 
ambiguous.  For  a  definition  one  usually  refers  to  a 
complete  local  basis  as  the  eigenstates  of  a  local 
operator  (or  set  of  those):  For  the  two-level  system 
studied  here  we  use  the  local  basis  operators, 
A3 (fi).  Products  of  their  eigenstates,  \p\p2, •  •  -,Pn), 
Pi  =  1,2 then  define  a  complete  basis  set  for 
the  whole  network,  taken  as  our  “classical” 
reference. 

1.7.  What  are  “Non-classical”  States? 

Superpositions  of  those  classical  states  are  now 
non-classical  by  definition:  There  are  superposi¬ 
tions  within  a  single  subsystem  only  (local  coher¬ 
ence,  described  here  by  the  expectation  values  of 
Ai(aO?A2 (m))  and  superpositions  involving  more 
than  one  subsystem  (entanglement,  described  by 
appropriate  expectation  values  of  m  >  2  -  cluster 
-  operators).  The  fact  that  the  subsystems  are 
separable  does  not  imply  that  the  state  of  the 
network  is  separable  into  these  constituents:  The 
implies  a  kind  of  “quantum  fuzziness”  of  the 
subsystems:  Entangled  states  are  states  which, 
allowing  for  any  local  [5]  unitary  transformation, 
cannot  be  written  as  a  simple  product  state.  A 
useful  entanglement  measure  should  thus  be 
invariant  under  such  local  transformations.  (There 
is  a  generalization  for  mixed  states). 

1.8.  How  can  we  Quantify  Entanglement? 

Entanglement  has  been  introduced  as  a  property 
of  a  wavefunction  not  to  factor  into  a  product. 
This  can  be  tested  by  means  of  appropriate 
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expectation  values.  The  matrix-elements  [3] 

Mf  ■  =  <(a,-(m)  -  <a,(m)»(a»  -  &(")»  •  •  •) 

(2) 

are  of  the  form  of  covariances  known  from 
classical  statistical  physics;  non-zero  terms  indicate 
deviations  from  the  product  state  form  [6], 
Convenient  measures,  which  are  much  less  detailed 
but  turn  out  to  be  invariant  under  any  local 
unitary  transformations,  are  (for  n  =  2) 

r  =  piv°  =  \Y.m7m7 

3  ij  ilk 

(3) 

These  measures  have  upper  bounds;  here  they  are 
normalized  to  1 .  For  a  network  with  N  >  2 
subsystems  there  is  not  just  one  but  a  whole 
hierarchy  of  such  entanglement  measures.  In 
addition  to  these  we  also  introduce  a  measure  for 
local  coherence;  with  the  eigenstates  of  A3  as  the 
local  reference  (n  =  2),  a  possible  choice  is 

=  (Xi(fi))2  +  (A2(/x))2  (4) 

Of  course,  cannot  be  invariant  under  local 
unitary  transformations. 

1.9.  What  is  the  Origin  of  Entanglement? 

Perhaps  the  best  known  source  of  entanglement  is 
the  permutation  symmetry  requirement  for  indis¬ 
tinguishable  particles.  However,  such  Fermi  (Bose-) 
states  are  not  characterized  by  specific  entangle¬ 
ment  types.  Eigenstates  of  global  operators  like 
total  angular  momentum,  may  also  exhibit  en¬ 
tanglement.  A  dynamical  source  of  entanglement, 
finally,  can  be  traced  back  to  any  two-subsystem 
interaction  (like  Coulomb  interaction):  The  corre¬ 
sponding  Hamiltonian  generates  a  unitary  time 
evolution  which  is  non-local  and  will  thus,  in 
general,  change  the  entanglement. 


1.10.  What  are  Observable  Effects 
of  Entanglement? 

While  coherence  physics  can  do  with  appropriate 
local  measurements  (m=  1),  entanglement  shows 
up  not  before  m  >  2-cluster-operators  are  in¬ 
volved:  Typically  this  applies  to  the  total  Hamil¬ 
tonian  itself;  well-known  effects  are  energy  shifts 
and  (exchange-)  splittings  for  Fermi-systems. 

More  specifically,  entanglement-physics  relates 
to  the  study  of  a  single  network  through  a  number 
of  selective  modes  or  contacts.  We  will  discuss 
three  qualitatively  different  examples  based  on  the 
same  type  of  model. 

2.  NUMERICAL  SIMULATIONS 

2.1.  Outline  of  Model 

The  model  we  are  considering  could  be  an  array  of 
N  =  3  quantum  dots,  here  simplified  as  two-level 
subsystems.  In  order  these  to  represent  real 
subsystems,  they  need  to  be  addressable  sepa¬ 
rately.  We  assume  they  are  spectroscopically 
different,  i.e.,  they  can  be  distinguished  in 
frequency  space  and  thus  be  separately  driven  by 
a  classical  external  electromagnetic  field  (coupling 
strength  gM,  detuning  <5M).  Their  mutual  coupling, 
C£\  is  of  the  Ising-spin  type.  This  coupling 
modifies  the  energy  level  spacing  of  one  system 
depending  on  the  state  of  its  neighbors.  The 
eigenstates  of  this  interacting  network  are  still 
product  states,  though.  Local  coherence  is  intro¬ 
duced  by  the  external  driving  fields,  which  is  then 
transformed  into  entanglement  by  means  of  the 
interaction.  In  this  way  there  is  an  entanglement 
source  that  can  be  switched  on  and  off  [3]. 

2.2.  Quantum  Trajectories 

In  the  first  two  examples  all  three  subsystems 
“play  the  same  role”,  i.e.,  all  the  parameters  g,  8, 
CF  are  independent  of  fi;  they  are  chosen  such  that 
only  the  transitions  from  the  ground  state  1 1 11)  to 
|1 12),  1 121)  and  |211)  are  in  resonance.  In  addition 
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there  are  3  local  damping  channels,  again  all  with 
the  same  damping  rate  W.  The  dynamics  is 
described  by  the  appropriate  master  equation. 

As  we  are  interested  in  the  detailed  simulation  of 
individual  networks,  the  theoretical  approach  of 
choice  is  stochastic  unraveling  (see  [2]  and  ref¬ 
erences  therein):  In  this  method  the  master 
equation  is  interpreted  as  a  rule  for  a  modified 
non-unitary  continuous  evolution  of  a  pure  state, 
interrupted  by  stochastic  jumps  to  another  pure 
state.  Incoherent  driving  is  described  as  a  selective 
rate-process  (transition  rates  (gM)27r/2)  implying 
ft1”  =  /3123  =  0  at  all  times  (“classical  case”). 
While  the  latter  (Fig.  la)  is  confined  to  the  definite 
alternatives  of  being  in  the  lower  or  upper  state 
(“telegraph  signal”),  the  trajectories  in  the  non- 
classical  domain  (Fig.  lb)  are  strongly  correlated 
(they  practically  coincide  here)  and  show  a  quite 
different  behavior:  Between  the  jumps  there  is  a 
continuous  motion,  as  coherence  and  entangle¬ 
ment  allow  to  explore  additional  regions  of  state 
space:  Figure  2  shows  the  time-averaged  entangle¬ 
ment  measures  as  they  depend  on  g/W. 


timet 

FIGURE  1  Quantum  trajectories  of  a  driven  and  damped 
N  —  3 -network  (g/W=  2.5):  (a)  classical  limit  (incoherent 
pumping  rate  =  g  7r/2),  (b)  non-classical  case  (coherent  driv¬ 
ing)- 


FIGURE  2  Averaged  coherence-,  a,  and  entanglement- 
measures,  pT,  /?123  as  a  function  of  the  coherent  driving  field 
strength,  g ,  over  the  damping  rate,  W. 

2.3.  Quantum  Noise 

The  quantum  trajectories  as  such  do  not  constitute 
a  measurement,  let  alone  a  complete  one.  How¬ 
ever,  assuming  the  damping  of  each  subsystem  to 
be  due  to  radiative  decay  only,  the  trajectories  of 
the  above  model  directly  connect  to  specific 
luminescence  signals:  Each  photon  detection  event 
in  one  of  the  distinguishable  frequency  channels 
(photon  counts  per  sampling  time  T)  is  related 
to  a  jump  in  the  trajectory.  The  simulated 
detection  traces  may  then  be  statistically  analyzed 
in  terms  of  the  counting  covariances  [7]. 

N9*'”  =  («**  -  W){n1/  -W7),..  (5) 

As  shown  in  Figure  3,  the  noise  properties,  indeed, 
reflect  the  fact  that  the  network  state  is  (on 
average)  non-classical  and  follows  non-classical 
trajectories.  The  result  for  incoherent  driving  is 


FIGURE  3  Photon-counting  covariance  of  the  luminescence 
signals:  Shown  is  7?123  =  AT123/(AnA22A33)1/2  with  -  as 
defined  in  Eq.  (5).  The  sampling  time  is  W~\ 
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included  for  comparison  (broken  line);  there  is 
virtually  no  correlation  JV123  in  this  case. 

2.4.  Quantum  Parallels™ 

Contrary  to  the  two  preceding  examples  we  turn 
now  to  a  closed  system,  with  the  sub-systems 
playing  different  roles:  These  roles  are  implemen¬ 
ted  by  means  of  pulsed  changes  of  the  parameters 
controlling  the  Hamilton  model:  Well-known 
examples  are  7r-pulses  to  invert  the  occupation  of 
2-level  systems.  Interacting  networks  of  the  type 
studied  here  allow  to  implement  conditioned  7r- 
pulses,  i.e.,  the  action  of  the  light  pulse  on 
subsystem  fx  =  1  is  a  7r-pulse  only,  if  the  neighbor 
/x  =  2  is  in  state  1 1),  say;  no  action  otherwise  [8].  A 
continuously  driven  system  2  in  contact  at  time  t\ 
with  such  a  “memory-gate”  1  (pulse  duration  is 
assumed  negligible),  subsequently  at  t2  in  contact 
with  a  gate  3  will  generate  entanglement  as 
discussed  before.  However,  the  “logic”  of  these 
gate  operations  make  system  2  emulate  an 
ensemble  that  was  actually  measured  at  those 


FIGURE  4  Single  closed  network:  Subsystem-Bloch-vectors 
(in  (A2),  (A3)-plane)  at  3  instatnts  of  time.  Subsystem  p  =  2  is 
continuously  driven  (rotates  counter-clockwise)  and  is  brought 
into  contact  with  a  “gate”-subsystem  p  —  1  at  time  and  with 
a  “gate”  p  =  3  at  time  t2  (broken  lines:  before  gate  operation). 
Each  gate  is  seen  to  “project”  subsystem  2  on  the  (A3) -axis,  as 
in  an  ensemble  measurement.  Note  that  a  single  system 
measurement  would  result  in  either  up  or  down. 


two  times  (Fig.  4):  System  2  represents  in  parallel 
all  potential  trajectories  going  through  the  two 
measurement  steps!  This  quantum  parallelism 
exploits  entanglement  in  a  surprising  way.  So- 
called  quantum-algorithms  would  make  sure  that 
final  measurements,  which  invariably  pick  an 
actual  trajectory  and  thus  deviate  from  ensemble 
behavior,  indicate  desired  problem  solutions  [9]. 
Their  reliance  on  large-A-quantum  networks 
makes  them  a  challenging  target  indeed  for  future 
developments. 
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A  Generalized  Monte  Carlo  Approach  for  the  Analysis  of 
Quantum-Transport  Phenomena  in  Mesoscopic  Systems: 
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A  theoretical  investigation  of  quantum-transport  phenomena  in  mesoscopic  systems  is 
presented.  In  particular,  a  generalization  to  “open  systems”  of  the  well-known 
Semiconductor  Bloch  equations  is  proposed.  Compared  to  the  conventional  Bloch 
theory,  the  presence  of  spatial  boundary  conditions  manifest  itself  through  self-energy 
corrections  and  additional  source  terms  in  the  kinetic  equations,  which  are  solved  by 
means  of  a  generalized  Monte  Carlo  simulation. The  proposed  numerical  approach  is 
applied  to  the  study  of  the  scattering-induced  suppression  of  Bloch  oscillations  in 
semiconductor  superlattices  as  well  as  to  the  analysis  of  quantum-transport  phenomena 
in  double-barrier  structures. 


Keywords:  Quantum,  transport,  simulation,  mesoscopic,  coherence,  dissipation 


1.  INTRODUCTION 

The  Monte  Carlo  (MC)  method,  which  has  been 
applied  for  more  than  25  years  for  calculation  of 
semiclassical  charge  transport  in  semiconductors, 
is  the  most  powerful  numerical  tool  for  micro¬ 
electronics  device  simulation  [1].  However,  it  is 
well  known  that  present-day  technology  pushes 

device  dimensions  toward  limits  where  the  tradi¬ 
tional  semiclassical  transport  theory  can  no  longer 
be  applied  and  a  more  rigorous  quantum  transport 
theory  is  required  [2], 


In  this  paper,  a  generalized  MC  approach  for 
the  analysis  of  hot-carrier  transport  and  relaxation 
phenomena  in  quantum  devices  is  proposed.  The 
method  is  based  on  a  generalized  MC  solution  of 
the  set  of  kinetic  equations  governing  the  time 
evolution  of  the  single-particle  density  matrix.  Our 
approach  can  be  regarded  as  an  extension  to  open 
systems  of  the  generalized  MC  method  recently 
proposed  for  the  analysis  of  the  coupled  coherent 
and  incoherent  carrier  dynamics  in  photoexcited 
semiconductors  [3].  Compared  to  more  academic 
quantum-kinetic  approaches  [4]  -  whose  applica- 
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tion  is  often  limited  to  highly  simplified  physical 
models  and  conditions-,  the  proposed  simulation 
scheme  can  be  applied  to  realistic  cases,  allowing 
on  the  one  hand  a  proper  description  of  quantum- 
interference  phenomena  induced  by  the  potential 
profile  and  on  the  other  hand  maintaining  all  the 
well  known  advantages  of  the  conventional  MC 
method. 

As  a  first  application,  we  will  discuss  the 
scattering-induced  suppression  of  Bloch  oscilla¬ 
tions  in  semiconductor  superlattices.  As  a  second 
example,  we  will  study  the  strong  interplay 
between  coherence  and  relaxation  within  a  dou¬ 
ble-barrier  structure. 


2.  THEORETICAL  APPROACH 

In  order  to  properly  describe  carrier-transport 
phenomena  in  mesoscopic  structures,  let  us  con¬ 
sider  a  generic  electron-phonon  system,  whose 
Hamiltonian  can  be  schematically  written  as: 


The  basic  quantity  in  our  theoretical  approach  is 
the  single-particle  density  matrix 

Pap  =  i&p&a)  •  (3) 

Its  diagonal  elements  correspond  to  the  usual 
distribution  functions  of  the  semiclassical  Boltz¬ 
mann  theory  (fa  =  paa)  while  the  off-diagonal 
terms  (a  ^  ft)  describe  the  degree  of  quantum- 
mechanical  phase  coherence  between  states  a  and 
/?.  Starting  from  the  Heisenberg  equations  of 
motion  for  the  operators  aa  it  is  possible  to  derive 
a  set  of  equations  of  motion  for  the  density-matrix 
elements  pap,  called  semiconductor  Bloch  equa¬ 
tions  (SBE)  [5],  whose  general  structure  is  given 
by: 

d  d  d 

dtpaP  ~  dtpa0  ffo  +  dtpaP  H;  (4) 

The  time  evolution  induced  by  the  single-particle 
Hamiltonian  H0  can  be  evaluated  exactly  yielding: 


H=  (Hc  +  Hp)  +  (. Hcc  +  Hep)  =H0  +  H (1) 


Ho 


ih 


^  Pot' ft f  —  ^aj3Pa/3‘ 

a'(3' 


(5) 


The  single-particle  term  H0  is  the  sum  of  the 
noninteracting  carrier  and  phonon  Hamiltonians 
Hc  and  Hp  while  the  many-body  contribution  H' 
includes  carrier-carrier  as  well  as  carrier-phonon 
interactions.  More  specifically,  the  single-particle 
Hamiltonian  Hc  describes  the  noninteracting 
carrier  system  within  the  potential  profile  of  our 
mesoscopic  structure  (including  possible  external 
fields).  By  denoting  with  <j>a  (r)— (r|a)  the  wave- 
function  of  the  single-particle  state  a  and  with  sa 
the  corresponding  energy,  the  noninteracting- 
carrier  Hamiltonian  reads: 

Hc  =  ^  ^  Zgolx&a-  (2) 

a 

Here,  the  usual  second-quantization  picture  in 
terms  of  creation  (a^)  and  destruction  (aa) 
operators  has  been  employed. 


with  u)ap  =  (ea  —  ep)/ti.  On  the  contrary,  the  time 
evolution  due  to  the  many-body  Hamiltonian  Hf 
involves  phonon-assisted  as  well  as  higher-order 
density-matrices;  Thus,  approximations  are 
needed  in  order  to  “close”  our  set  of  equations 
of  motion  (with  respect  to  our  kinetic  variables). 
In  particular,  the  “mean-field”  approximation 
together  with  the  Markov  limit  approximation 
allow  us  to  derive  a  set  of  closed  equations  of 
motion  which  is  still  local  in  time: 

-jPa/3  =]Cr'a/3  a,f},pa'l)>-  (6) 

al  H'  a'0‘ 

The  matrix  T^a/^  is,  in  general,  a  complex 
quantity:  its  real  part  describes  a  sort  of  generali¬ 
zed  scattering  rate  while  its  imaginary  part  is 
related  to  energy-renormalization  phenomena. 
Recently,  a  full  quantum-mechanical  approach 
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has  been  proposed  [6],  which  overcomes  the  usual 
Markov  limit  in  describing  the  carrier-phonon 
interaction.  However,  due  to  the  huge  amount  of 
CPU  time  required,  its  applicability  is  still  limited 
to  short  time-scales  and  extremely  simplified 
situations. 

The  analysis  presented  so  far  is  typical  of  a  so- 
called  “closed”  system,  i.e.,  a  physical  system 
defined  over  the  whole  coordinate  space.  However, 
this  is  not  the  case  of  interest  for  the  analysis  of 
quantum-transport  phenomena  in  mesoscopic 
devices,  where  the  properties  of  the  carrier 
subsystem  are  strongly  influenced  by  spatial 
boundary  conditions.  In  order  to  better  under¬ 
stand  this  crucial  point,  let  us  reconsider  our 
theoretical  scheme  in  terms  of  real-space  coordi¬ 
nates. The  proper  quantum-mechanical  description 
is  then  based  on  the  so-called  Wigner  function  [7], 
given  by  the  following  Weyl-Wigner  transforma¬ 
tion  of  the  single-particle  density  matrix: 

fW{r !  k)  —  ^2  papUapir,  k)  (7) 

aft 

with 


By  applying  the  above  Weyl-Wigner  transforma¬ 
tion  to  the  non-interacting  SBE  (5),  we  obtain: 

jfW(r,k,t)  I  =  f  dr'dk'Lo(r,k;r',k')x 

i  Ho  j  (y) 

xfW(r',k',t) 

where 


For  the  case  of  a  closed  system,  the  Wigner 
function  fw  is  defined  for  any  value  of  the  real- 
space  coordinate  r  and  its  time  evolution  is  fully 
determined  by  its  initial  condition. 

On  the  contrary,  for  the  case  of  an  open  system, 
fw  is  defined  only  within  a  given  region  of 
interest  and  its  time  evolution  is  determined  by 
the  initial  condition  inside  such  region  as 
well  as  by  its  values  on  the  boundary  rb  of  the 
domain  f 1  at  any  time  tf  >  f0.  More  specifically,  by 
applying  the  Green’s  function  theory  [8]  to  Eq.  (9) 
we  have: 

fw(r,k\  t)  =  J  dr'  J  dk'Go(r,k;rf  ,k!\t  -  to)x 
x/V,*Vo)+ 

-i  I drb  I I  *"~Ox 

^  fT{n,k\t,)v{kt)1  (11) 

where 

Go(r,k;r',k';T )  =  Y^u«f){r,k)e~™aeTKt3(r'k') 

afi 

(12) 

is  the  single-particle  evolution  operator,  while  v(k) 
is  the  component  of  the  carrier  group  velocity 
normal  to  the  boundary  surface.  We  clearly 
see  that  the  value  of  fw  is  obtained  from  the 
single-particle  propagation  of  the  initial  condi¬ 
tion  inside  the  domain  ft  plus  the  propaga¬ 
tion  of  the  boundary  values  fY  from  the  points  of 
the  surface  at  any  time  tf  to  the  point  r,  k  of 
interest. 

Let  us  now  come  back  to  the  density-matrix 
formulation  by  introducing  the  inverse  Weyl- 
Wigner  transform: 


Lo{r,k-,r',k')  =  -i  ^  uap(r,  k)uagu*a0(r' ,  k ') 

a{3 

(10) 

is  the  single-particle  Liouville  operator  in  the  r,  k 
representation. 


Pap  =  Jadr  J  dku*ap(r,  k)  fw(r,  k).  (13) 

The  above  density  matrix  p  provides  a  description 
of  our  open  system  equivalent  to  the  Wigner- 
function  one.  By  applying  the  above  transforma- 
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tion  to  Eq.  (11)  and  then  performing  its  time 
derivative,  we  finally  obtain: 

=  7fc  £gp,OL'p'Pa'&i  +  Safi,  (14) 
Ho  ina'p' 

where 

£a0,a'0'  =  {U£U-']apal0,  (15) 

is  the  single-particle  self-energy  tensor  in  Eq.  (5) 
“dressed”  by  the  transformation 

Uap}a'pf  =  J^dr  J  dku*a0(r,k)ua,B’{r,  k),  (16) 

while 


d  _ 
JtPa & 


Sa0  =  'Y^Ua^al0'  /  drb  /  d ku*a,p, (rb, k)v(k) x 
a'{3r  J  J 

xf^(rb,k) 

(17) 

is  a  source  term  induced  by  our  spatial  boundary 
conditions. 

Eq.  (14)  is  the  desired  generalization  to  the  case 
of  open  systems  of  the  standard  single-particle 
SBE  in  Eq.  (5).  In  addition  to  the  source  term  (17) 
(whose  explicit  form  depends  on  the  particular 
values  of/f  on  the  surface  of  our  domain),  the 
presence  of  spatial  boundary  conditions  induces 
modifications  on  the  self  energy  £  of  the  system  via 
the  transformation  U  in  Eq.  (16). 


3.  SOME  SIMULATED  EXPERIMENTS 

The  theoretical  approach  presented  so  far  is  the 
starting  point  of  our  MC  simulation.  The  general¬ 
ized  SBE  (4)  are  solved  by  means  of  the  same 
MC  simulation  scheme  described  in  [3].  The 
method  is  based  on  a  time-step  separation  between 
coherent  and  incoherent  dynamics  (see  Eq.  (4)). 
The  former  accounts  in  a  rigorous  way  for  all 
quantum  phenomena  induced  by  the  potential 


profile  of  the  device  as  well  as  for  the  proper 
boundary  conditions  (see  Eq.  (14)).  The  latter, 
described  within  the  basis  given  by  the  eigen¬ 
states  a  of  the  potential  profile  (see  Eq.  (6)),  ac¬ 
counts  for  all  the  relevant  scattering  mechanisms 
by  means  of  a  conventional  “ensemble”  MC 
simulation  [1]. 

The  above  numerical  approach  has  been  applied 
to  the  study  of  the  scattering-induced  suppression 
of  Bloch  oscillations  in  semiconductor  superlat¬ 
tices  and  to  the  analysis  of  quantum-transport 
phenomena  in  a  double-barrier  structure. 

In  the  first  case,  a  biased  GaAs/AlGaAs  multi¬ 
quantum-well  (MQW)  has  been  considered  (15 

o  ^  o 

periods  of  a  20  A  barrier  and  a  100  A  well  with  a 
barrier  height  of  0.5  eV).  The  electric  field  in  the 
MQW  region  is  about  12  kV/cm.  The  ultrafast  dy¬ 
namics  of  an  injected  wavepacket  (injection  energy 
£=50  meV)  has  been  simulated.  Figure  1  shows 
the  position  mean  value  of  the  wavepacket  as  a 
function  of  time.  For  the  scattering-free  case,  the 
expected  Bloch  oscillations  in  real  space  are  well 
reproduced.  They  originate  from  the  coherent 
motion  of  our  wavepacket  through  the  superlattice 
structure.  In  the  presence  of  scattering  mechan¬ 
isms,  such  oscillations  are  strongly  damped.  This  is 
mainly  due  to  carrier-LO  phonon  scattering, 
whose  typical  time-scale  is  comparable  with  the 


FIGURE  1  Position  mean  value  of  an  electronic  wavepacket 
within  a  biased  semiconductor  super  lattice  as  a  function  of 
time  with  and  without  scattering  (see  text). 
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sub-picosecond  time-scale  of  our  Bloch  oscilla¬ 
tions. 

We  have  than  carried  out  the  simulation  of  an 
electronic  wavepacket  entering  (from  left)  a 
double-barrier  structure.  The  injected  carriers  are 
energetically  in  resonance  with  the  second  level  of 
the  well.  In  Figure  2  the  charge  distribution  across 
the  double-barrier  structure  is  shown  at  different 
times  with  and  without  scattering.  As  expected,  for 
the  scattering-free  case  (Fig.  2b)  we  obtain  the 
well-established  resonance  scenario:  the  wavepack¬ 
et  enters  in  resonance  with  the  second  energy  level 
of  the  structure  and  after  some  time-delay  a  part  of 
it  is  transmitted  and  a  part  is  reflected.  On  the 
contrary,  in  the  presence  of  scattering  events  (Fig. 
2a),  the  resonance  process  is  strongly  influenced  by 
a  charge  transfer  from  the  second  to  the  first 
energy  level.  This  effect  is  clearly  evidenced  by  the 
charge  distribution  inside  the  well  region,  which 
for  the  free-scattering  case  exhibits  the  typical 


second-level  symmetry  while  in  the  presence  of 
scattering  it  approaches  the  symmetry  of  the 
ground  level. 

4.  CONCLUSIONS 

In  this  paper,  we  have  presented  a  generalization  to 
open  systems  of  the  well-known  semiconductor 
Bloch  equations.  The  proposed  approach,  based  on 
the  density-matrix  formalism,  allows  a  proper 
description  of  the  strong  coupling  between  coher¬ 
ent  and  incoherent  dynamics  in  mesoscopic  sys¬ 
tems.  These  generalized  SBE  are  numerically  solved 
by  means  of  the  same  MC  simulation  scheme  used 
already  for  the  study  of  ultrafast  carrier  dynamics 
in  photoexcited  semiconductors  [3].  Our  simulated 
experiments  clearly  show  the  failure  of  any  pure 
coherent  or  incoherent  approach  in  describing  such 
quantum-transport  regime. 


With  scattering  [MC] 
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Without  scattering 


FIGURE  2  Charge  distribution  across  a  double-barrier 
structure  at  different  times  corresponding  to  an  electron 
wavepacket  injected  in  resonance  with  the  second  level  (see 
text).  The  scattering  rates  have  been  artificially  increased  by  a 
factor  10  with  respect  to  the  GaAs  values  in  order  to  emphasize 
the  relaxation  to  the  first  level. 
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We  present  two  examples  of  coherent  control  of  inter(sub)band  transitions  in  a 
semiconductor  double  well  by  coherent  light  sources.  Accounting  for  the  upper  hole 
subband  and  two  lowest  electron  subbands,  a  microscopic  theoretical  analysis  shows 
that  electron-hole  pair  generation  by  a  sub-picosecond  pump  pulse  can  be  controlled  by 
the  intensity  and  the  phase  of  a  dc  microwave  field  which  resonantly  couples  the  two 
electron  subbands.  Light  absorption  can  be  either  enhanced  or  reduced.  Secondly,  it  is 
shown  that  proper  combination  of  two  pulsed  laser  fields  allows  control  of  electron 
inter(sub)band  transitions  and  final-state  population,  i.e.,  the  formation  of  indirect 
versus  direct  excitons. 


Keywords:  Coherent  control,  semiconductor,  quantum  well,  optics,  theory 


Coherent  control  of  final-state  population  and 
chemical  reactions  has  long  been  pursued  in 
atomic  and  molecular  physics  [1-5].  More  re¬ 
cently,  improvements  in  ultrafast  spectroscopy 
have  allowed  induction  and  observation  of  coher¬ 
ent  phenomena  in  semiconductors,  in  form  of 
coherent  charge  oscillations  in  double  wells,  Bloch 
oscillations,  and  coherent  control  of  photocurrent, 
to  name  some  of  the  highlights  of  progress  in 
recent  years  [6-10].  In  this  paper  we  investigate 
coherent  control  of  inter(sub)band  transitions, 
absorption,  and  final-state  population  theoreti¬ 
cally.  Specifically  we  discuss  a  scheme  which 
allows  phase  controlled  light  absorption  and 


control  of  final-state  population  in  semiconductor 
heterostructures  by  the  interplay  of  two  coherent 
light  sources.  As  a  specific  example  we  give  results 
for  an  asymmetric  145  A/25  A/100  A  GaAs-Al- 
GaAs-GaAs  double  well  whose  electron  subband 
splitting  is  controlled  by  a  static  electric  field  and 
which  is  exposed  to  a  tunable  sub-picosecond 
pump  pulse  which  generates  electron-hole  pairs 
(excitons)  across  its  main  energy  gap.  Calculations 
are  done  within  the  framework  of  a  microscopic 
theory  in  form  of  Boltzmann-Bloch  equations 
which  account  for  the  carrier -carrier  Coulomb 
interaction  [11].  In  the  present  study,  subband 
splittings  and  time  scales  are  such  that  LO  phonon 
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scattering  may  be  neglected.  Light  pulses  are  of 
Gaussian  shape  and  the  light-matter  coupling  is 
treated  for  a  classical  light  field  including  the 
counter-rotating  part.  The  peak  of  the  pump  pulse 
arrives  at  time  zero,  relative  to  which  we  define  the 
phase  (j)  of  the  microwave  (MW)  field  £Mw(0  = 
Eo(t) cos(u>Mwt  +  <f>).  A  comparison  to  the  rotat- 
ing-wave  approximation  (RWA),  which  has  been 
used  in  an  earlier  study,  is  made  [12]. 

When  a  coherent  dc  MW  field  resonantly 
couples  the  two  electron  subbands,  the  system 
undergoes  Rabi  oscillations  between  its  uncoupled 
eigenstates  when  originally  prepared  in  one  of  the 
latter.  In  the  situation  depicted  in  Figure  1,  the 
dipole  matrix  element  between  hole  |1)  and  upper 
electron  subband  |2)  is  much  stronger  than  hole 
and  lower  electron  subband  |3).  Hence,  if  a  pump 
pulse  is  applied  at  resonance  with  the  direct 
exciton  maximum  absorption  is  obtained.  How¬ 
ever,  application  of  a  MW  field  reduces  the 
admixture  of  left-well  eigenfunction  |  W)  in  |2) 
and  transfers  it  to  |3).  Hence,  the  MW  field 
reduces  absorption  at  the  direct  exciton  peak  [13]. 
Conversely,  if  the  pump  pulse  is  tuned  near  the 
indirect  exciton,  application  of  a  MW  field  enhances 


1 


FIGURE  1  Illustration  of  a  biased  asymmetric  GaAlAs- 
GaAs  double  well.  |1),  |2)  and  |3)  denote  double-well 
eigenstates;  \W)  and  |N)  are  the  lowest  wide-well  and 
narrow-well  eigenstates,  respectively.  The  shape  of  wave 
functions  is  indicated  by  thin  solid  lines  on  top  of  the 
corresponding  energy  levels. 


absorption.  The  phase  of  the  coherent  dc  MW  field 
enters  the  coupling  between  the  electron  subbands 
and  hence  the  complex  electron  interband  polariza¬ 
tion.  When  the  pump  pulse  duration  is  shorter  than 
the  inverse  of  the  MW-induced  Rabi  frequency,  the 
phase  influences  the  absorption  process  from  a  third 
level,  here,  the  top  hole  subband  |1).  This  is 
demonstrated  in  Figure  2  for  the  case  where  a  80 
fs  pump  pulse  of  Gaussian  profile  is  tuned  near 
resonance  with  the  indirect  excitons  of  the  DW  with 
a  subband  splitting  of  20  meV.  The  MW  intensity  is 
about  2  MWcm-2,  corresponding  to  a  Rabi  period 
near  100  fs.  Clearly,  the  presence  of  the  MW  field 
increases  absorption  by  about  15  percent  for  phase 
7r/2.  This  effect  is  more  pronounced  at  the  direct 
exciton  peak  [13]  The  RWA  underestimates  the 
importance  of  second  harmonics  in  the  carrier 
dynamics  induced  by  the  MW  field.  Therefore, 
shorter  pump  pulses  than  predicted  by  within  the 
RWA  may  be  required  to  display  the  phase 
dependence  of  absorption. 

For  the  present  three-level  system  and  within  the 
RWA  for  transitions  between  conduction  and 
valence  bands ,  the  pump  pulse  photon  Boltzmann 
equation  in  the  presence  of  coherence  in  the  carrier 


Number  of  e-h  Pairs  versus  Time 


(subbands  20  mev  off  resonance,  laser  20meV  below  direct  exciton  line) 


Time  (fs) 


FIGURE  2  Density  of  photo-generated  electron-hole  pairs  as 
a  function  of  time  and  MW  phase  for  a  pump  pulse  photon 
energy  which  is  20  meV  below  the  direct  exciton  peak. 
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system  is  of  the  structure  [11]. 

dt  2,3 

-fn(k,t)(l-faa'(k,t))N(q,  0}, 

E  MHAq)M*Ha(q)6E 

"  k, a, a' =  2,3 

{faa'(k,  t)  -fu(k,  /)},  for  iV(^r,  f)»l, 

where  faot(k,  t)  are  carrier  distribution  functions, 
for  a  =  a',  and  interband  polarizations,  for  a  ^  a'. 
iV(#,  0  is  the  photon  occupation  number  and 
Maof(q)  is  the  matrix  element  for  coupling  sub¬ 
band  a  to  subband  af  via  photon  q.  8E  is  the 
appropriate  energy-conserving  delta  function.  This 
equation  shows  that  electron  interband  polariza¬ 
tion  influences  photon  absorption.  Figure  3,  in 
turn,  shows  how  the  phase  of  the  MW  field 
determines  the  sign  of  the  real  part  of  the  electron 
interband  polarization  during  the  presence  of  the 
pump  pulse  to  allow  coherent  control  of  the 
absorption  process. 

A  possible  scheme  to  control  the  final-state 
population  in  semiconductors  resembles  previous 
work  on  atoms  for  which  adiabatic  switching  has 
been  demonstrated  to  allow  transitions  between 
molecular  levels  which  are  dipole-forbidden  by 


FIGURE  3  Real  part  of  ]  W)  -  |JV)  interband  polarization  as 
a  function  of  time  and  MW  phase. 


means  of  optical  coupling  to  a  third  level  [4,  5]. 
In  this  scheme,  which  is  sketched  in  Figure  1,  two 
temporally  and  spatially  overlapping  light  pulses 
couple  three  levels  of  the  system.  The  initial  state 
of  the  system  is  |1),  the  desired  final  state  is  |3), 
and  the  intermediate  state  is  |2).  If  the  light  pulse 
coupling  |2)  and  |3),  with  amplitude  a{t )  and 
duration  7mw>  arrives  and  ends  before  arrival  and, 
respectively,  end  of  the  pulse  coupling  |1)  and  |2), 
with  amplitude  b(i)  and  duration  Tp ,  and  the  pulse 
amplitudes  change  sufficiently  slowly  (relative  to 
the  characteristic  frequencies  of  the  driven  three- 
level  system),  stimulated  Raman  adiabatic  passage 
(STIRAP)  occurs  and  the  system  undergoes  an 
adiabatic  transition  from  level  1 1 )  to  level  |3).  [4] 
The  condition  for  an  adiabatic  transition  for 
T  =  Tp—  7mw  is 


T  • 


where  6  is  the  light  field  detuning. 

However,  in  semiconductors  resonant  coupling 
between  subbands  calls  for  MW  pulses,  which  may 
be  difficult  to  generate  at  sufficiently  high  intensity. 
Moreover,  due  to  characteristic  decoherencing 
times  for  (free)  carriers  in  semiconductors  passage 
times  can  not  be  much  longer  than  a  picosecond. 
Therefore,  the  STIRAP  process  may  be  imprac¬ 
tical,  if  not  impossible,  to  be  adopted  to  semi¬ 
conductor  nanostructures.  Hence,  we  have 
investigated  sub-picosecond  nonadiabiatic  transfer 
based  on  the  three-subband  scheme  in  Figure  1, 
where  subband  1 1)  is  the  top  hole  subband  of  a 
heterostructure,  |2)  and  |3)  are  the  two  lowest 
electron  subbands  separated  by  25  meV,  in  the 
present  case.  A  lOOfs  subpicosecond  pump  pulse  is 
used  to  generate  electron-hole  pairs  (direct  ex- 
citons)  associated  with  levels  |1)  and  |2).  Its 
duration  must  be  shorter  than  the  inverse  Rabi 
frequency  between  subband  |2)  and  |3)  which  is 
established  by  a  concurrent  MW  pulse.  For 
excitation  densities  of  about  1010  carriers  per  cm2 
this  ensures  MW-induced  charge  oscillations 
between  subband  |2)  and  |3).  Intensity  and 
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duration  of  the  MW  field  are  adjusted  so  that  the 
latter  permits  one  half  of  a  Rabi  oscillation 
between  |2)  and  |3),  such  that  the  electrons  get 
trapped  in  state  |3)  («  \N))  after  a  single  tunneling 
process.  Here,  the  MW  pulse  resonantly  couples 
the  two  electron  subbands  (8  =  0)  and  has  a 
duration  of  TMW  =  320  fs,  corresponding  to  about 
two  MW  periods,  and  a  peak  intensity  of  about  1 
MW  cm-2.  Zero  time  delay  between  the  two  pulses 
and  phase  tt  of  the  MW  field  relative  to  the  peak  of 
the  pump  pulse  were  found  to  give  best  results. 

In  Figure  4  we  show  the  number  of  electrons  in 
the  left  well  versus  the  number  of  electrons  in  the 
right  well,  and  the  total  number  of  carriers  (holes). 
Calculations  including  the  counter-rotating  field 
contributions,  thick  lines,  are  compared  to  those 
within  the  RWA,  thin  lines.  It  is  clearly  evident 
that  the  presence  of  the  MW  field  reverses  the 
tendency  for  direct  (solid  lines)  versus  indirect 
exciton  (dot-dashed  lines)  formation.  Without 
MW  field  and  owing  to  the  shape  of  wave 
functions,  predominantly  direct  excitons  are 
formed  initially.  Figure  4  also  shows  that  the 
RWA  gives  almost  quantitatively  correct  results 
for  the  final  state  population  in  the  present  case. 
However,  it  predicts  simple  (damped)  harmonic 
charge  oscillations,  whereas  the  full  calculation 
shows  a  more  complicated  dynamics. 


FIGURE  4  Carrier  densities  in  the  double  well  versus  time. 
Solid  lines:  wide  well  (direct  excitons);  dot-dashed  lines:  narrow 
well  (indirect  excitons);  dashed  lines:  holes  (total  number  of 
excitons);  thick  lines:  with  microwave  field  (phase  7 r)  regular 
lines:  no  MW  field;  thin  lines:  MW  field  in  RWA  approxima¬ 
tion. 


In  summary,  we  have  given  theoretical  results 
which  indicate  that  coherent  control  of  intersub¬ 
band  transitions  on  a  subpicosecond  time  scale  is 
possible  by  means  of  coherent  light  sources.  In 
particular,  we  have  investigated  coherent  control 
of  light  absorption  and  final-state  population  in 
semiconductor  double  wells.  It  should  be  pointed 
out  that  the  latter  should  also  be  achievable  by 
interference  between  single  and  triple  photon 
absorption,  in  analogy  to  experiments  on  mole¬ 
cules  [3],  and  coherent  current  control  via  inter¬ 
ference  between  single  and  two-photon  absorption 
[8].  Details  of  these  findings  will  be  published 
elsewhere  [12].  The  rotating-wave  approximation, 
which  allows  analytic  solution  of  the  coupled 
three-level  system,  is  found  to  give  nearly  quanti¬ 
tatively  correct  results  for  the  non-adiabatic  final- 
state-control  processes.  However,  for  the  coherent 
control  process  of  absorption  discussed  here,  it 
merely  gives  qualitatively  correct  results.  Specifi¬ 
cally,  it  fails  to  give  an  accurate  account  of  charge 
oscillations  induced  by  the  MW  field  and,  conse¬ 
quently,  the  correct  value  for  the  phase  of  optimal 
coupling. 
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Quantum  transport  in  multiple-barrier  systems  exposed  to  intense  laser  fields  is 
investigated  by  employing  an  analogy  with  the  Fabry-Perot  interferometer  of  classical 
optics.  This  analogy  is  exact  if  no  further  approximations  are  introduced.  The  Fabry- 
Perot  approach  is  shown  to  be  quite  robust:  Approximating  the  barriers  by  5-functions 
with  equivalent  dc  transmission  and  reflection  amplitudes  still  yields  excellent  agreement 
with  “exact”  orthodox  transfer-matrix  calculations.  The  advantage  of  the  former  is  its 
much  reduced  computational  demands  as  well  as  ease  of  use.  73.40.Gk,  73.50.Pz, 
72.40.  +  w,  42.50.Hz. 
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INTRODUCTION 

It  is  commonly  agreed  that  ultimately,  with  the 
ever  increasing  miniaturization  of  devices,  one  will 
have  to  resort  to  quantum-transport  theories  for  a 
proper  modeling  of  device  characteristics.  On  the 
other  hand,  it  is  equally  clear  that  today’s 
quantum-transport  theories  are  less  mature  than 
semi-classical  approaches  [1]  and  demand  huge 
computer  resources  in  order  to  give  only  reason¬ 
ably  accurate  results.  Moreover,  most  quantum- 
transport  theories  currently  available  are  restricted 
to  steady-state  operations  or  a  small-signal  analy¬ 
sis.  Although  strict  theoretical  treatments  of  non¬ 
equilibrium  Green’s  functions  with  arbitrary  time- 
dependence  and  initial  many-particle  correlations 


do  exist  [2],  they  are  generally  too  complicated  to 
be  easily  implemented.  Nevertheless,  serious  efforts 
are  being  made  in  this  direction  (See,  for  instance, 
the  NEMO  project  described  in  [3]). 

The  issue  we  want  to  address  in  this  paper  is  that 
of  quantum  transport  in  the  presence  of  intense 
external  time-dependent  driving  fields  where  the 
usual  small-signal  analysis  fails.  The  subject  of  (in) 
coherent  quantum  transport  in  strongly  driven 
heterostructures  has  recently  gained  much  mo¬ 
mentum  with  the  advent  of  free-electron  lasers, 
which  are  capable  of  providing  intense  radiation 
[4].  Here  we  report  on  a  new  approach  to  the 
notoriously  complex  problem  of  a  theoretical 
description  of  quantum  transport  in  intense  laser 
fields  that  is  based  on  an  analogy  with  the  Fabry- 
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Perot  interferometer  in  optics.  In  previous  studies, 
a  transfer-matrix  method  had  been  employed  to 
calculate  the  reflection  and  transmission  ampli¬ 
tudes  of  scattering  states  in  driven  heterostructures 
[5-10].  Due  to  the  large  size  of  the  transfer  matrix 
even  for  a  one-dimensional  system  (the  size  being 
proportional  to  the  highest  order  of  photon 
processes  one  wishes  to  take  into  account),  these 
calculations  demand  large  computer  resources 
when  aimed  at  studying  the  regime  of  strong 
driving.  In  contrast,  the  Fabry-Perot  approach  is 
rather  modest  in  its  computational  demands  and 
moreover,  much  faster  for  not  too  complex 
heterostructures.  Our  aim  here  is  not  to  develop 
a  fully-fledged  theory  for  realistic  device  simula¬ 
tion  that  includes  all  possible  scattering  mechan¬ 
isms,  but  rather  to  put  forward  a  useful  tool  for 
gaining  a  good  understanding  of  the  basic  physics 
involved  in  the  quantum-transport  in  strongly 
driven  systems.  To  this  end  we  seek  to  solve  the 
time-dependent  single-particle  Schrodinger  equa¬ 
tion  but  refrain  from  making  any  assumptions 
whatsoever  about  the  strength  of  the  driving  field. 
The  additional  degree  of  freedom  introduced  by  an 
explicit  time  dependence  of  the  Hamiltonian  opens 
up  the  possibility  for  new  effects  to  be  explored, 
only  a  few  of  which  have  already  been  discovered 
and  studied  in  some  detail,  such  as  the  “dynamic 
localization”  of  minibands  in  driven  superlattices 
[11,12]  and  the  “quenching  of  tunneling”  in  driven 
tunneling  diodes  [6,  8]. 

FABRY-PEROT  THEORY 

The  textbook  approach  to  calculating  the  prob¬ 
ability  for  an  electron  to  tunnel  through  a  multi¬ 
ple-barrier  structure  is  to  solve  for  the  corres¬ 
ponding  scattering  state  with  the  proper  boundary 
conditions,  which  consist  of  a  traveling  incoming 
wave  and  reflected  as  well  as  transmitted  outgoing 
waves.  A  convenient  way  of  doing  this  is  to  set  up 
and  solve  a  transfer-matrix  problem  [6,  8].  There 
is,  however,  an  alternative  approach  that  takes 


advantage  of  an  analogy  with  optics  [13]:  Rather 
than  solving  a  large  system  of  simultaneous 
equations,  one  can  track  the  electron  in  a  manner 
resembling  ray  tracing:  An  electron  incident  on  the 
first  barrier  has  some  probability  to  tunnel.  If  it 
does  tunnel,  it  will  travel  to  the  next  barrier,  where 
it  may  be  reflected  or  again  tunnel.  However,  the 
electron  reflected  at  the  second  barrier  is  not  lost, 
it  still  has  a  chance  to  tunnel  through  this  barrier. 
All  it  needs  to  do  is  to  travel  back  to  the  first 
barrier,  get  reflected  there  and  come  back  to  the 
second  barrier  to  have  a  second  try.  It  is  clear  that 
in  a  multiple-barrier  structure  the  number  of 
possible  paths  an  electron  can  take  and  still  finally 
arrive  at  the  far  side  is  enormous.  However,  all 
these  paths  can  be  summed  up  in  a  few  geometric 
series  as  we  will  demonstrate  in  the  following. 

Before  proceeding  any  further,  however,  let  us 
introduce  the  notation  pertinent  to  tunneling  in 
driven  systems.  The  main  difference  compared  to 
ordinary  tunneling  in  static  structures  is  that  in  a 
harmonically  driven  system  the  electron  can  pick 
up  or  lose  integer  units  of  photon  quanta  htu  from 
the  driving  field.  Hence,  the  electron  is  not 
restricted  to  its  original  energy  E,  but  rather  has 
a  finite  probability  of  being  found  in  one  of  the 
photonic  sidebands  or  “channels”  at  energies 
E  +  nhuj.  As  a  consequence,  partial  transmission 
probabilities  will  need  to  be  calculated  for  all 
channels  individually.  Obviously,  the  number  of 
sidebands  to  take  into  account  for  this  will  depend 
on  the  strength  of  the  driving  field. 

Let  us  denote  the  partial  transmission  amplitude 
of  a  single  driven  barrier  from  channel  j  (i.e.,  with 
incident  energy  EJ)  to  channel  /  as  tljj ,  where  the 
superscripts  /,  r  indicate  from  which  side  the 
electron  impinges  on  the  barrier  (see  Fig.  1).  A 
similar  notation  is  used  for  the  reflection:  rjjr. 
Furthermore,  the  “optical”  path  across  a  quantum 
well  of  width  d  carries  the  phase  factor  g/y  =  £/y 
exp  ( ik}d )  with  ki  —  yj2m{E§  +  TfaJj/h  being  the 
wave  vector  of  sideband  /. 

For  a  double-barrier  diode  in  particular,  the 
transmission  amplitudes  for  an  electron  incident, 
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1st  barrier  2nd  barrier 


FIGURE  1  Schematics  of  the  notations  used  for  the  transmission  and  reflection  amplitudes  and  the  optical  path  Q  in  the  quantum 
well  that  underlie  the  Fabry-Perot  approach. 


say,  from  the  left-hand  side  can  be  summed  up  in  a 
geometric  series,  which  in  a  shorthand  matrix 
notation  can  be  written  as 

t  =  ^'(I  +  R  +  R2  +  ■  ■  -)Qt1'1  =  t  V(I  -  R)~l Qt1’1 , 

(1) 


with  /  being  the  identity  matrix  and  the  kernel  R  — 
Qrx,r  Qr 2,1  describing  one  reflection  cycle  in  the 
quantum  well  starting  and  ending  at  the  left  wall  of 
the  right  barrier.  Note  that  while  in  the  static  case  R 
is  a  scalar,  it  is  a  matrix  of  infinite  rank  for  finite  ac 
driving.  In  a  similar  fashion,  the  reflection  ampli¬ 
tudes  at  the  double-barrier  diode  are  found  as 

r  =  ru  +  t  l'rQr 2,1  (I  -  RyxQtX'1.  (2) 

To  give  the  reader  an  impression  of  the 
complexity  to  expect  when  treating  more  than 
two  barriers,  we  also  present  the  formula  for  the 
transmission  amplitude  through  a  triple  barrier 
diode, 

t=fil 

1 

x /■ -  (74) 1  (7^)  fl>  *'C.  '2" 
l 

where  the  reciprocal  is  understood  to  denote  the 
matrix  inverse. 


62^ 


1 


(/-*0 


Q\t 


i,/ 


(3) 


As  usual,  the  transmission  probability  is  ob¬ 
tained  from  the  transmission  amplitude  via  7}/  = 
\kj/ki\mi/mj\tji\2.  The  main  advantage  of  a  formula¬ 
tion  along  these  lines  is  that  the  basic  building 
blocks  of  reflection  and  transmission  amplitudes 
through  single  barriers  may  be  determined  by  other 
means,  either  approximately  or  even  exactly.  For 
instance,  assuming  the  photon  energy  to  be  small 
compared  to  the  energy  of  the  incident  electron,  the 
transmission  and  reflection  amplitudes  through  a 
single  ^-function  barrier,  driven  by  a  uniform  ac 
field  Fcos  utf,  are  approximately  given  by  [13] 

in  =tfci  Sn  o, 

rn  =rdc(-i)nJn(2d),  (4) 

where  a  =  eFko/mu)2,  with  k0  being  the  wave  vector 
of  the  incident  electron,  Jn  the  n  th  Bessel  function 
of  the  first  kind  and  tdc  and  the  rdc  the  transmission 
and  reflection  amplitudes,  respectively,  for  the 
static  barrier.  In  this  approximation,  the  kernel  R 
of  Eqs.  (1)  and  (2)  describing  the  multiple  reflec¬ 
tions  in  the  quantum  well  is  evaluated  as 

Rji  =rdJr  dJ  XI  exp(%rf)^-/-V7_/_„(2a) 

n 

x  Qxp(iki+nd)(-i)nJn(2a) 

«  4r4' exp{z‘M [2  +  v(3/  +  l)/4}} 

x  Jj-i(\4a  sin(/c0<iv/4)|),  (5) 
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where  v  —  Huj/Eq  is  the  dimensionless  photon 
energy. 

RESULTS  FOR  A  DOUBLE-BARRIER  DIODE 

For  the  remainder  of  this  paper  we  shall  focus  on 
results  for  double-barrier  diodes  driven  by  a 
spatially  uniform  ac  field  F  cos  u>t.  Although  the 
Fabry-Perot  method  by  itself  is  exact,  the  sub¬ 
sequent  approximations  made  above  such  as  the 
assumption  of  5-barriers  etc.  may  affect  the  final 
result.  However,  our  findings  indicate  that  the 
agreement  with  the  standard  transfer-matrix  tech¬ 
nique  [10]  is  still  excellent.  Figure  2a  presents 
typical  results  of  the  Fabry-Perot  approach,  based 
on  Eqs.  (1)  and  (4-5).  Shown  is  the  probability  for 
an  electron  incident  at  the  energy  E0  of  the 
undriven  resonance,  to  transverse  a  symmetric 
double-barrier  structure  and  emerge  at  the  far  side 
in  sideband  n ,  i.e.,  at  energy  Eq  +  rihu>.  To  leading 
order,  the  sideband  amplitudes  depend  on  the 
dimensionless  parameter  a  =  eFk0/muj2.  For 
comparison,  we  also  present  in  Figure  2b  the 
results  of  a  calculation  using  transfer  matrices.  The 
agreement  is  very  good,  with  minor  discrepancies 
appearing  in  the  asymmetry  of  the  “mirror” 
sidebands  at  n  =  ±\n\,  which  are  due  to  the 
approximations  introduced  in  Eqs.  (4-5).  Apart 
from  being  computationally  faster  by  one  or  two 
orders  of  magnitude  compared  to  the  transfer- 
matrix  method,  a  major  virtue  of  the  Fabry-Perot 
approach  lies  in  the  fact  that  for  generating  Fig¬ 
ure  2a  we  have  only  used  the  dc  transmission  and 
reflection  amplitudes  of  a  single ,  undriven  barrier. 
This  is  a  substantial  advantage,  as  it  allows  us  to 
“plug  in”  the  transmission  and  reflection  ampli¬ 
tudes  obtained  by  using  other  approaches.  In¬ 
cidentally,  the  fact  that  the  entire  effect  of  the 
driving  ac  field  can  be  subsumed  in  the  Bessel 
function  factors  of  Eq.  (4)  is  a  rather  surprising 
result  by  itself.  And  last  but  not  least,  the  Fabry- 
Perot  approach  is  more  amenable  to  introducing  a 
phase-breaking  mechanism  by  randomizing  the 


phase  in  the  optical-path  matrix  Q  traversing  the 
quantum  well  [14]. 

The  agreement  between  the  Fabry-Perot  and  the 
transfer-matrix  method  is  just  as  good  for  other 
energies  of  the  incident  electron.  In  Figure  3  we 
assumed  all  parameters  to  be  the  same  as  in 
Figure  2,  except  for  the  electron’s  energy,  which 
we  took  to  be  equal  to  that  of  the  n  =  -1  sideband, 
i.e.,  E  =  Eq  —  Hu).  Most  noteworthy  in  this  Figure 
is  the  exact  reproduction  of  the  splitting  of  the 
n  =  0  and  n  —  2  channels  for  small  values  of  eFk0/ 
mu2,  which  cannot  be  accounted  for  in  a  simpler 
product  ansatz  proposed  earlier  [6]. 

The  pronounced  dips  in  the  transmission  prob¬ 
ability  seen  in  Figures  2  and  3  at  particular  values 
of  the  driving  field  are  a  manifestation  of  the 


eFk0/mco2 

FIGURE  2  Transmission  probability  in  the  lowest  few 
photonic  sidebands  through  a  driven  double-barrier  diode 
(see  inset  in  top  panel)  as  a  function  of  driving  strength  for  an 
electron  incident  at  energy  E0,  the  energy  of  the  lowest 
resonance  in  the  static  quantum  well:  a)  Fabry-Perot,  b) 
transfer-matrix  method. 
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FIGURE  3  Same  as  Figure  2  ,  but  for  an  electron  incident  at 
energy  E$  -  hv,  i.e.,  one  photon  quantum  below  the  quantum- 
well  resonance. 


“quenching  of  tunneling”  predicted  for  strongly 
driven  systems  [6].  Basically,  in  order  to  traverse 
the  double-barrier  structure,  an  incident  electron 
first  has  to  couple  into  the  driven  resonance  in  the 
quantum  well.  The  efficiency  for  this  to  happen  is 
proportional  to  the  overlap  of  the  electronic  wave 
function  with  that  of  the  driven  resonance.  A 
similar  overlap  integral  occurs  when  the  electron 
exits  the  quantum  well.  Evidently,  the  electron 
cannot  couple  to  the  entire  wave  function  of  the 
driven  resonance,  but  rather  interacts  only  with  its 
energy  component  or  “spectral  weight”  at  the 
energy  of  the  electron .  This  spectral  weight  turns 
out  to  be  a  strong  function  of  the  driving  field  [9] 
and  in  particular  may  have  pronounced  minima 
under  suitable  driving  conditions,  which  leave 
their  fingerprints  as  characteristic  minima  in  the 
transmission  probability. 


CONCLUSIONS 

In  conclusion,  a  Fabry-Perot  approach  has  been 
used  to  reduce  the  problem  of  coherent  transmis¬ 
sion  and  reflection  in  strongly  driven  complex 
tunneling  structures  to  that  of  single  driven 
barriers.  This  approach  gives  valuable  insight  into 
the  physics  involved  and  is  easy  to  use  at  the  same 
time.  It  is  even  possible  to  go  one  step  further  by 
replacing  the  finite-height  barriers  with  6  barriers 
having  the  same  effective  dc  transmission  and 
reflection  amplitudes.  It  then  turns  out  that  the  ac 
driving  field  affects  only  the  relative  amplitudes  of 
the  photonic  sidebands,  while  the  dc  transmission 
and  reflection  probabilities  determine  the  absolute 
magnitude  summed  over  all  sidebands  -  even  for 
finite  ac  fields. 
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Device-device  interaction  is  described  within  the  framework  of  the  quantum  Liouville 
equation.  Approximations  to  treating  this  interaction,  within  the  framework  of 
alterations  to  phase  space  boundary  conditions,  are  illustrated. 


Keywords:  Liouville  equation,  phase  space  boundary  conditions,  quantum  devices,  thermal 
deBroglie  wavelength,  device-device  interactions 


INTRODUCTION 

Classical  descriptions  of  device-device  interactions 
involve  simple  boundary  conditions,  often  through 
an  external  circuit,  or  in  some  cases  through 
electromagnetic  interactions.  Quantum  device-de- 
vice  interactions  are  richer,  as  there  are  several 
critical  lengths.  Device-device  interactions  on  a 
scale  of,  or  smaller  than,  a  thermal  deBroglie 
wavelength,  AoeBrogiie  =  yjh1  /2mkBT,  introduce 
coherence,  and  two  barriers  separated  by  distances 
of  the  order  of  AdeBrogiie  result  in  resonant 
tunneling.  Debye  length-scales  are  a  second  critical 
length.  But  the  critical  length-scale  of  interest  is  an 
inter-device  length,  Ainter-device>  which  is  the  mini¬ 
mum  inter-device  separation  before  changes  in  one 
device  affect  the  properties  of  an  adjacent  device. 


The  concept  of  an  inter-device  separation  is  more 
general  than  that  associated  with  either  the 
thermal  deBroglie  wavelength  or  the  Debye  length, 
as  its  value  arises  from  phase  space  boundaries  and 
involves  at  least  two  coordinates.  In  this  paper  we 
illustrate  the  concept  of  an  inter-device  length  for  a 
pair  of  double  barriers. 

We  first  examine  the  form  of  the  Liouville 
potential  in  the  coordinate  representation:  V(x,  x ') 
=  V(x  -  Vx '),  x  and  x'  being  phase  space 
coordinates.  Based  on  the  form  of  V(x,  x' )  we 
identify  both  non-interacting  and  interacting 
diodes.  We  then  solve  the  quantum  Liouville 
equation  for  a  surrogate  problem  where  changes 
in  the  charge  distribution  arise  from  changes  in  the 
phase  space  boundary  conditions  that,  we  argue, 
represent  the  device-device  interaction. 
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The  case  of  non-interacting  diodes  is  identified 
in  Figure  1,  where  we  display  V(x,  x ')  for  a  pair  of 
double  barrier  structures.  The  coordinate  divisions 
represent  multiples  of  an  isolated  device,  and  for  a 
double  barrier  RTD  of  length  200  nm,  the  number 
2  corresponds  to  400  nm.  In  the  following  we  will 
refer  to  two  regions:  region  1,  where  0  <  jc<  1,0 
< x  '  <  1,  and  region  3:  1  <x  <2, 1  <x  '  <2.  The 
pairs  of  double  barriers  are  centered  in  regions  1 
and  3,  and  the  absence  of  any  curvature  away  from 
the  double  barrier  Liouville  potential  indicates 
that  these  two  structures  are  not  interacting. 
Figure  2  shows  a  modification  of  the  Liouville 
potential  for  a  perturbation  centered  on  the 
diagonal  in  region  1  at  a  point  x  =  jt'  =  0.75.  The 
distortion,  which  spills  into  region  ‘3  as  shown  in 
Figure  3,  will  alter  the  density  matrix  in  region  3. 
On  the  basis  of  Figure  3  and  diagrams  similar  to  it, 
we  argue  that  introducing  variations  in  the  phase 
space  boundary  conditions  can  approximate  the 
effect  of  device-device  interaction  on  the  behavior 
of  any  given  device. 

THE  ROLE  OF  PHASE  SPACE  BOUNDARY 
CONDITIONS  AND  CONCLUSIONS 

While  solutions  to  the  quantum  Liouville  equation 
provide  the  means  for  examining  device-device 


FIGURE  1  The  Liouville  potential  for  a  pair  of  non¬ 
interacting  double  barrier  diodes  centered  in  regions  1  and  3. 


FIGURE  2  The  Liouville  potential  for  a  perturbation  in 
region  1,  that  spills  over  into  region  3.  These  structures  are 
interacting. 


FIGURE  3  A  display,  within  region  3,  of  a  section  of  the 
distorted-Liouville  potential  of  Figure  2.  Note  the  distortion  of 
the  Liouville  potential  near  the  origin  of  region  3. 


interactions  on  a  large  scale,  as  a  practical  matter 
this  approach  is  premature,  as  computational 
times  would  be  excessive.  Rather,  we  solve  the 
Liouville  equation  subject  to  boundary  conditions 
that  represent  the  effects  of  device-device  interac¬ 
tion.  In  particular  we  have  solved  the  quantum 
Liouville  equation  [1]: 

( dp(x,x')  dp(x,x') 

V  dt  dt 

n2  /  o2  o ^  \ 

“  "Si  +  n*, *>(*,*'). 
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for  a  200  nm  square  region  for  symmetric  and 
asymmetric  double  barrier  structures,  and  sym¬ 
metric  quantum  wells.  For  noninteracting  struc¬ 
tures  the  density  matrix  boundary  conditions 
along  the  line  x'  =  200nm,  was  exp  [-((x-x')  /  4 
AdeBroglie)2]-  For  calculations  representing  interact¬ 
ing  structures  the  boundary  condition  is  A(x-x') 
exp[-((x-x')/4  AdeBroglie)2]-  The  modulating  func- 
tion  was  chosen  arbitrarily.  Figure  4  displays  the 
boundary  variation  invoked  in  this  study,  and 
Figure  5  displays  the  effects  of  this  boundary 
variation  on  the  charge  density  within  the  sym¬ 
metric  RTD.  Figure  5  also  displays  the  potential 
energy.  The  calculation  with  the  smallest  charge 
within  the  barrier  occurs  for  a  modulating  func¬ 
tion  that  is  set  to  unity.  For  the  change  in  the 
modulating  function  shown  in  Figure  4,  there  is 
significant  charge  accumulating  within  the  barri¬ 
ers.  Very  similar  results  emerge  from  the  asym¬ 
metric  double  barrier  structures.  Significantly,  the 
form  of  the  perturbation  affects  the  form  of  the 


FIGURE  4  Variations  of  the  density  matrix  boundary 
condition  along  x'  =  200nm  for  a  double  barrier  structure. 


FIGURE  5  The  density  distributions  for  a  non-self  consistent 
solution  to  the  quantum  Liouville  equation  subject  to  the 
variations  of  Figure  4. 


charge  density  alterations.  Additional  calculations 
reveal  that  the  effect  of  the  boundary  is  most 
dramatic  for  perturbations  close  to  the  origin. 
Because  of  the  Gaussian  decay  of  the  boundary 
condition,  the  further  the  perturbation  is  away 
from  the  origin  the  weaker  the  effect.  The  results 
also  suggest  that  the  shorter  the  inter-device 
separation  the  greater  the  interaction.  Supplemen¬ 
tary  calculations  indicate  that  double  barrier 
diodes  with  cladding  regions  doped  to  1024/m3, 
require  a  minimum  separation  of  approximately 
180  nm  to  be  considered  as  isolated. 
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One  of  the  most  promising  applications  of  single-electronics  is  a  single-electron  memory 
chip.  Such  a  chip  would  have  orders  of  magnitude  lower  power  consumption  compared 
to  state-of-the-art  dynamic  memories,  and  would  allow  integration  densities  beyond  the 
tera  bit  chip.We  studied  various  single-electron  memory  designs.  Additionally  we  are 
proposing  a  new  memory  cell  which  we  call  the  T-memory  cell.  This  cell  can  be 
manufactured  with  state-of-the-art  lithography,  it  operates  at  room  temperature  and 
shows  a  strong  resistance  against  random  background  charge. 

Keywords:  Memory,  single-electron  tunneling,  simulation,  random  background  charge 


1.  INTRODUCTION 

Over  the  last  years  many  applications  of  the 
Coulomb  blockade  and  of  Coulomb  oscillations, 
so  called  single-electron  devices,  have  been  pro¬ 
posed.  The  most  fundamental  device  is  the  single¬ 
electron  transistor  [1]  which  was  soon  followed  by 
pumps,  turnstiles,  supersensitive  electro-meters, 
memories  and  logic  gates  (see  for  example  [2]). 
Many  of  these  devices  have  been  fabricated  as 
laboratory  prototypes  with  various  manufacturing 
processes,  and  have  been  operated  usually  at 
temperatures  at  or  below  liquid  Helium  (4.2  K). 
Some  effects  could  be  seen  and  some  prototypes 
could  be  operated  up  to  liquid  nitrogen  tempera¬ 
ture  (77  K)  [3]  and  only  very  few  functioned  at 


room  temperature  (300  K)  [4].  But  almost  all  of 
these  prototypes  suffer  from  the  extraordinary 
charge  sensitivity  of  single-electron  devices  which 
makes  them  strongly  vulnerable  to  uncontrollable 
impurities;  traps  and  charge  transport  in  other 
parts  of  the  circuit.  Additionally  all  employed 
manufacturing  processes  have  not  been  proven  to 
be  applicable  for  mass  production.  That  is,  the 
challenge  for  the  next  years  is  to  find  reliable  and 
reproducible  mass  production  methods  for  robust 
room  temperature  single-electron  devices. 

All  results  and  conclusions  are  based  on 
computer  simulations  which  have  been  performed 
with  our  single-electron  device  and  circuit  simu¬ 
lator  SIMON.  For  more  information  about 
SIMON  please  refer  to  [5, 6]. 
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2.  DIFFERENT  SINGLE-ELECTRON 
MEMORY  CELLS 

In  the  following  we  are  describing  briefly  the 
function  of  different  single-electron  memory  cells. 
Some  of  them  have  already  been  extensively 
treated  in  other  publications. 

2.1.  Flip-Flop 

One  design  possibility  is  to  mimic  conventional 
CMOS  architecture  with  single-electron  devices.  A 
static  single-electron  memory  cell  design  or  flip- 
flop,  based  on  this  approach  was  proposed  by  A. 
Korotkov  et  al.  [7], 

2.2.  Electron  Trap 

Nakazato  and  Ahmed  [8]  proposed  a  dynamic 
memory  cell  pushed  to  its  extreme  limit.  A  small 
number  of  electrons  is  stored  on  a  single  quantum 
dot.  The  central  island  of  a  single-electron 
transistor  which  is  extremely  charge  sensitive, 
may  be  used  as  sensor  for  the  logic  states.  However 
exactly  this  charge  sensitivity  makes  these  devices 
prone  to  random  background  charge.  The  more 
tunnel  junctions  are  used  the  less  likely  it  is  that 
electrons  are  co-tunneling  to  ground.  That  is  why 
this  memory  is  a  dynamic  memory,  since  the 
charges  can  not  be  trapped  infinitely.  Nevertheless, 
the  refresh  rate  can  easily  extend  to  the  second  or 
minute  range. 

2.3.  Ring  Memory 

Another  idea  which  is  a  generalization  of  the 
bistable  quantum  cell  for  cellular  automata  by 
Lent  et  al.  [9]  is  the  ring  memory  cell.  An  even 
number  n  of  tunnel  junctions  are  connected  to  a 
ring,  and  «/2  electrons  are  inserted  into  the  ring. 
These  electrons  will  repel  each  other  and  can  form 
two  stable  configurations.  Applying  voltage  pulses 
will  switch  the  state  of  the  ring  to  either  one  of  the 
stable  configurations. 


2.4.  Q  o  -Independent  Memory 

The  first  random  background  charge  or  go- 
independent  memory  was  proposed  by  Likharev 
and  Korotkov  [10],  The  basic  idea  is  the  following. 
Electrons  are  stored  on  an  island  or  floating  gate. 
A  single-electron  transistor  which  is  very  charge 
sensitive  on  its  gate,  is  used  to  sense  the  changes  of 
charge  on  the  floating  gate.  The  trick  to  achieve 
the  go-'ndependence  is  not  to  sense  any  absolute 
charges,  but  to  sense  the  current  oscillations  in  the 
single-electron  transistor,  which  are  caused  by 
changes  in  the  charge  on  the  gate.  In  other  words, 
the  charge  change  on  the  floating  gate  induces 
current  oscillations.  These  oscillations  occur  at  any 
background  charge.  Only  the  phase,  not  the 
amplitude  is  background  charge  dependent.  The 
cell  can  only  be  read  destructively  by  discharging 
the  floating  gate.  If  current  oscillations  are 
detected,  the  floating  gate  was  charged  with 
electrons.  If  no  oscillations  are  detected,  no  charge 
was  stored  on  the  floating  gate. 

2.5.  Discussion  of  Simulation  Results 

We  simulated  the  above  memory  cells  and 
especially  studied  their  operation  temperature, 
complexity  and  random  background  charge  de¬ 
pendence.  Due  to  lack  of  space  we  can  not  discuss 
the  details  of  our  simulation  results.  But  all  of  the 
above  designs  have  at  least  one  flaw.  Many  are 
random  background  charge  dependent  (flip-flop, 
electron  trap,  ring  memory).  Some  are  quite 
complex  and  show  a  relative  low  operation 
temperature  compared  to  other  designs  with  the 
same  tunnel  junction  parameters  (flip-flop,  ring 
memory). 

3.  THE  THREE  CHALLENGES 

Thus  we  can  name  three  challenges  that  have  to  be 
addressed: 

•  Room  temperature  operation, 

•  Industrial  mass  production, 
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•  Random  background  charge  independence  for 

reliable  operation. 

Coulomb  blockade  effects  are  only  observed  if 
the  charging  energy  associated  with  the  tunneling 
of  a  single  electron  is  bigger  than  the  thermal 
energy  kT,  with  k  the  Boltzmann-constant,  and  T 
the  absolute  temperature.  To  produce  devices 
which  operate  at  room  temperature  a  resolution 
limit  of  <  10  nm  is  necessary.  This  means,  that  for 
industrial  mass  production  only  naturally  formed 
tunnel  junctions  (poly-silicon,  granular  films)  are 
feasible  today.  It  is  not  possible  with  todays 
industrial  lithography  to  produce  a  single  tunnel 
junction  with  dimensions  smaller  10  nm.  The 
resolution  limit  of  optical  lithography  is  more 
than  a  magnitude  larger  than  10  nm.  Clearly,  the 
next  decade  will  see  new  high  resolution  lithogra¬ 
phy  methods  (X-ray,  e-beam,  near-field,  nano¬ 
imprint,  STM,  AFM,  . . .),  but  until  they  are  not 
available  for  industrial  mass  production  only 
naturally  formed  tunnel  junctions  in  granular  films 
can  reach  resolutions  in  the  nano-meter  regime 
and  thus  allow  room  temperature  operation. 


the  multi-island  transistor  which  is  controlled  by  a 
gate  electrode  which  stores  either  a  number  of 
electrons  or  a  number  of  holes.  Writing  a  ‘I’  or  ‘0’ 
is  done  by  applying  a  positive  or  negative  voltage 
pulse  at  the  word-line.  This  will  charge  the  gate 
electrode  which  has  a  similar  function  as  the 
floating  gate  in  a  flash  memory  cell.  Thus  in  our 
example  a  positive  write  pulse  on  the  word-line  will 
store  some  holes  on  the  gate  electrode  representing 
‘1’  and  a  negative  write  pulse  will  store  some 
electrons  on  the  gate  electrode  representing  ‘O’. 
Destructively  reading  the  memory  cell  is  done  by 
applying  for  example  a  negative  voltage  pulse  at 
the  word-line  and  sensing  current  oscillations  of 
/out.  If  7out  oscillates  then  the  cell  held  a  T.  If  no 
oscillations  are  picked  up  then  the  cell  held  a  ‘O’. 
Accordingly  the  contents  of  the  cell  has  to  be 
restored. 

The  '/'-cell  can  be  viewed  as  a  combination  of  the 
electron  trap  [11]  and  the  go-independent  memory 
cell  [10],  with  the  change  that  we  use  tunnel 
junction  arrays  for  both  elements,  the  trap  and  the 
read-out  transistor.  This  change  makes  the  T- 
memory  cell  much  easier  to  manufacture.  In  fact, 


4,  A  NEW  MEMORY  CELL 

Combining  todays  lithography  with  granular  film 
processes  one  can  manufacture  what  we  call  a  T- 
memory  cell.  Two  granular  film  batches  are 
arranged  in  a  T-shape  (see  Fig.  1).  It  is  a  combi¬ 
nation  of  a  multi-electron  trap  with  a  multi-island 
transistor  for  read-out.  The  cross-bar  of  the  T  is 


read-out  source 


V  write 


Ed  Ed 

HM 

1 — — |  rJ~| 

EEI 

IH 

rrb 

leej 

HU 

i  •  •  ~~i  r"  ~  ~i 

r~  -J 

Th 

r-bi 

1  ltetI 

HUH 

f=F  F==t 

r~i 

lb 

FF 

FIGURE  1  r-memory  cell. 


FIGURE  2  T-memory  cell  circuit. 
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since  the  granular  film  batches  can  have  dimen¬ 
sions  in  the  100  nm  regime,  state-of-the-art  optical 
lithography  is  sufficient  to  produce  such  memory 
cells. 

Yano  et  al.  [12]  manufactured  memory  cells 
where  the  storage  dots  and  read-out  circuit  are 
located  in  the  same  granular  film,  a  poly-silicon 
film.  The  disadvantage  of  this  design  is,  that  it  is 
much  more  difficult  or  even  impossible  to  tune 
process  parameters  in  order  to  change  the  char¬ 
acteristics  of  storing  and  reading  separately.  In  our 
design,  the  granular  films  for  the  storing  part  and 
the  read-out  part  could  be  produced  with  entirely 
different  process  parameters,  thus  optimizing  each 
performance  characteristics. 


time  [arbitrary  units] 


FIGURE  3  Writing  and  reading  the  T-memory  cell. 


We  simulated  the  cell  of  Figure  2  and  similar 
ones.  In  Figure  3  we  give  a  typical  write-read  cycle 
for  a  T  and  a  ‘O’.  We  simulated  the  same  write- 
read  cycle  for  different  random  background 
charges  ( Q0  =  0,  [0.2e,  -0.2e],  [OAe,  -OAe])  and 
randomly  changed  tunnel  resistors  and  capacitors. 
As  can  be  clearly  seen,  the  Coulomb  oscillations 
are  only  present  when  a  T  was  stored  in  the 
memory,  independent  of  background  charge. 


5.  CONCLUSION 

We  simulated  and  studied  various  single-electron 
memory  designs.  By  analyzing  proposed  memories 
(flip-flop,  electron  trap,  go-independent  memory, 
. . .)  we  came  up  with  a  new  memory  cell,  our  so 
called  T-memory  cell.  It  is  manufacturable  with 
todays  production  methods,  operates  at  room 
temperature,  shows  a  strong  background  charge 
resistance,  and  is  down-scalable  to  atomic  dimen¬ 
sions. 
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Relaxation  time  due  to  electron-longitudinal-acoustic  (LA)  phonon  interaction  is 
calculated  in  a  GaAs  quantum  dot  (QD)  with  N  electrons  (from  N=  1  to  4),  where 
electrons  in  a  narrow  quantum  well  are  confined  by  a  parabolic  confining  potential,  by 
using  the  exact  eigen  states  of  electrons.  Although  the  energy  levels  become  dense  with 
increasing  the  number  of  electrons,  the  modification  of  the  relaxation  time  is  found  to  be 
not  so  strong,  which  attributes  the  fact  that  many  electron  eigen  states  consist  of  only  a 
few  dominant  single  electron  states  which  limit  the  electron  relaxation.  By  comparing 
relaxation  process  via  intermediate  states  with  the  direct  process,  several  fastest 
processes  are  found  to  be  realized  by  relaxation  through  intermediate  states  between  the 
initial  state  and  the  ground  state.  The  effect  of  change  in  the  quantum  well  width  is  also 
discussed. 


Keywords:  Electron-phonon  interaction,  quantum  dot,  phonon  bottleneck,  exact  diagonalization 
method,  Coulomb  interaction 


1.  INTRODUCTION 

Low  dimensional  semiconductor  structures  such  as 
quantum  dots  (QDs)  have  attracted  much  atten¬ 
tion  because  of  their  unique  features  in  electronic 
properties  [1-3]  and  their  applications  to  future 
electron  devices  [4,5].  Although  their  quantized 
atomic  like  energy  levels  are  expected  to  have 
advantages  for  optical  devices,  energy  relaxation 
via  electron-optical-phonon  interaction  is  dimin¬ 
ished  in  QDs  for  energy  level  separation  of  order  1 
meV  because  the  mismatch  between  the  energy 


level  separation  and  the  optical-phonon  energy  of, 
for  example,  36  meV  in  the  bulk  GaAs  [6-10], 
which  is  often  referred  as  “phonon  bottleneck”. 
Several  theoretical  and  experimental  approaches 
have  been  devoted  to  the  study  of  the  phonon 
bottleneck  and  pointed  out  a  possibility  to  avoid  it 
by  utilizing  Auger  processes  [11],  multiphonon 
processes  [12]  or  the  role  of  excited  energy  levels  of 
the  exciton  [13]. 

In  the  process  of  light  emission,  electrons  and 
holes  are  first  created  in  higher  energy  states  and 
relax  down  to  the  ground  state  in  cascade  via 
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phonon  emissions  and  finally  recombine  to  emit 
light.  Energy  relaxations  from  higher  energy  levels 
to  sublevels  near  the  ground  state  are  expected  to 
be  fast  because  optical-phonon  interaction  plays  a 
dominant  role  in  the  electron  relaxation.  At  the 
final  stage  of  electron  relaxation  toward  the 
ground  state,  electron-longitudinal-acoustic  (LA) 
phonon  interaction  is  expect  to  be  important 
because  of  the  small  energy  separation  compared 
to  the  optical-phonon  energy.  In  a  QD  containing 
a  few  electrons  with  dot  size  comparable  to  the 
effective  Bohr  radius,  many  electron  eigen  states 
are  considerably  modified  by  the  Coulombic 
interaction  between  the  electrons.  In  order  to 
investigate  electron-phonon  interaction  in  such  a 
QD,  we  therefore  take  into  account  the  exact  N- 
electron  eigen  states  (for  N  up  to  4)  by  numerically 
diagonalizing  the  complete  N  particle  Hamiltonian 
[14-16]  including  the  Coulombic  interaction 
between  electrons  and  then  calculate  electron- LA 
phonon  relaxation  rates  toward  the  ground  state 
using  the  N  particle  eigen  states.  Since  there  are 
many  intermediate  states  between  the  initial  state 
and  the  ground  state,  we  calculate  relaxation  time 
via  intermediate  states  in  order  to  find  the  fastest 
process. 


2.  EIGEN  STATES  OF  A  MODEL  QD 


=  —  2^- +  -m*  +  y])  +  V(zi)  (2) 

where  rn  is  the  effective  mass  of  an  electron. 
Along  the  z  direction,  we  assume  an  infinite  square 
well  potential  of  the  width  W  and  consider  only 
the  ground  state  associated  with  the  quantized  z 
motion,  because  electrons  are  frozen  out  into  the 
lowest  subband  in  experimentally  realized  dots. 

In  order  to  solve  the  N  particle  Hamiltonian,  we 
use  N  particle  Slater  determinants  as  the  basis  sets 
which  can  be  composed  by  the  solutions  of  the 
single  particle  Hamiltonian  Hot-  The  many  particle 
eigen  functions  are  then  obtained  by  diagonalizing 
the  many  particle  Hamiltonian.  Figure  1  shows  the 
lowest  40  eigen  energies  of  a  QD  with  single,  two 
and  three  electrons.  For  a  single  electron  QD, 
equally  separated  energy  levels  are  seen,  which 
reflects  the  single-electron  confining  energy 
tiujQ  =  1  meV.  When  more  electrons  are  added 
into  the  QD,  total  ground  state  energy  of  the 
system  moves  up  and  becomes  greater  than  the 
simple  sum  of  the  single-electron  energy.  More¬ 
over  the  energy  level  structure  becomes  more  dense 
with  increasing  the  number  of  electrons.  From  this 
analysis  the  Coulomb  interaction  is  expected  to 
play  an  very  important  role  in  determining  the 
energy  spectra  in  a  few  electron  system  in  a  QD 
[1,16]. 


We  consider  a  QD  fabricated  on  a  AlGaAs/GaAs/ 
AlGaAs  quantum  well  structure  as  a  model 
system.  The  confining  potential  of  a  circular  QD 
in  x-y  plane  parallel  to  the  hetero-interface  is 
modeled  by  an  isotropic  and  parabolic  potential 
o(x2  +  y2)  with  Hloq  being  the  single  electron 
confining  energy.  Throughout  this  paper  we 
assume  a  typical  value  of  huj o  —  1  meV  for  the 
parabolic  potential.  The  Hamiltonian  of  N  elec¬ 
trons  including  the  Coulomb  interaction  between 
the  electrons  can  be  written  as 


n  -  j]  Ho, + 
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FIGURE  1  Total  eigen  energies  in  a  QD  which  contains 
single,  two  and  three  electrons.  In  the  model  used  in  the  present 
study,  energy  level  separation  for  single  electron  QD  is 
Hcjq  =  1  meV.  For  a  QD  with  two  and  three  electrons,  energy 
level  spectra  become  to  form  more  complex  and  dense 
structures  due  to  the  Coulombic  interaction. 
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3  ELECTRON-LA  PHONON  INTERACTION 

Since  we  are  interested  in  energy  relaxation  of 
electrons  in  QDs  at  low  temperatures,  we  take  into 
account  only  the  phonon  emission  process  and 
thus  assume  the  lattice  temperature  T~  OK  for 
simplicity.  Since  the  maximum  energy  separation 
in  our  model  system  is  the  order  of  1  meV, 
electron-optical-phonon  relaxation  can  be  ignored 
and  the  electron-LA  phonon  interaction  domi¬ 
nates  the  relaxation  processes.  The  electron-LA 
phonon  scattering  rate  Wu>  derived  from  the  first- 
order  perturbation  theory  using  the  Fermi’s  gold¬ 
en  rule  is  described  by  the  following  relation 

Ww  =  y  E 1  (i'\He.ph\i)\2(nq  +  l)8(Ei.-Ei  +  huq) 
<1 

(3) 

where  nq  is  the  phonon  occupation  number  which 
is  equal  to  0  in  the  present  calculation.  The 
electron-LA  phonon  interaction  Hamiltonian 
He-ph  is  given  by  the  following  equation 

He-ph  =  E  E  vAq)C\,Ci{aq  +  a.q)  (4) 

ii'  q 

v “  -  <5> 

where  C\{C/)  is  an  operator  to  create  (annihilate) 
an  electron  of  state,  i  and  a J  ( aq )  is  an  operator  for 
creation  (annihilation)  of  a  phonon  with  momen¬ 
tum  q.  In  the  present  calculation  we  assume  that 
the  electrons  interact  with  bulk  LA  phonons  and 
thus  the  deformation  potential  for  electron-LA 
phonon  is  given  by  the  deformation  potential  of 
the  bulk  GaAs,  D  =  7  eV. 

4.  RESULTS  AND  DISCUSSION 

Figure  2  shows  the  relaxation  times  for  transition 
from  the  excited  states  to  the  ground  state  in  a  QD 
with  fiuo  =  1  meV  and  W=  50  A.  In  the  relaxation 
processes  we  have  to  take  into  account  scattering 
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FIGURE  2  Relaxation  time  as  a  function  of  the  energy 
difference  between  the  initial  state  and  the  ground  state  for  a 
two-electron  QD  (a).  Large  open  circles  and  small  solid  circles 
denote  the  direct  relaxation  and  the  fastest  relaxation, 
respectively.  A  schematic  diagram  for  the  direct  and  the  fastest 
relaxations  from  the  initial  state  (2,  0;  4)  is  also  plotted  (b). 


from  the  excited  state  to  the  ground  state  through 
various  intermediate  states  in  addition  to  the  direct 
transition.  In  Figure  2(a)  we  show  the  calculated 
relaxation  time  toward  the  ground  state  in  a  two- 
electron  QD  where  open  circles  denote  direct 
processes  and  the  fastest  relaxations  are  plotted 
by  the  solid  circles.  The  indices  ( M,S;J\f )  shown 
in  Figure  2  are  quantum  numbers  of  eigen  states, 
indicating  Af-th  state  of  iV-electrons  system  with  a 
total  angular  momentum  HM  and  a  total  spin 
ft  S/2.  The  connected  pairs  of  an  open  circle  and  a 
solid  circle  indicate  the  relaxation  from  the  same 
initial  states  and  mean  the  existence  of  faster 
indirect  process,  whereas  the  open  circles  with  the 
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solid  circles  indicate  that  the  fastest  process  is  the 
direct  process  and  the  indirect  process  is  very  slow 
because  of  the  symmetry  of  the  wave  functions. 
For  electrons  with  =  (2,  0;  4),  the 

fastest  indirect  transition  arises  from  the  scattering 
process  via  the  state  (2,0;0)  as  shown  by  the 
arrow.  A  schematic  diagram  of  the  relaxations  for 
the  (2, 0;  4)  state  is  illustrated  in  Figure  2(b),  where 
the  energy  separations  from  the  ground  state 
and  the  relaxation  times  are  given  for  the  corres¬ 
ponding  eigen  states  in  Figure  2(a).  The  relaxation 
time  for  the  direct  process  is  about  6.38  ns,  whe¬ 
reas  the  relaxation  time  for  the  indirect  process  is 
given  by  1.27  ns  which  is  given  by  the  sum  of  0.71 
ns  for  the  scattering  from  (2,0;  4)  to  (2, 0;0)  and 
0.56  ns  for  the  scattering  from  (2,0;0)  to  the 
ground  state  (0, 0;  0).  In  Figure  3  we  plotted  relaxa¬ 
tion  times  for  A  =3  and  4  electrons  in  a  QD  with 

0 

huj o  —  1  meV  and  W=  50  A,  where  the  electrons  at 
higher  energy  states  have  longer  relaxation  time  due 
to  the  difficulty  in  satisfying  the  energy  and  momen¬ 
tum  conservation.  Although  energy  levels  become 
dense  with  increasing  the  number  of  electrons, 
modification  of  relaxation  times  are  not  so  strong 
because  many  electron  eigen  states  consist  of  only  a 
few  dominant  single  electron  states  which  limit  the 
electron  relaxation  for  many  electron  states. 

In  Figure  4  we  compare  the  fastest  relaxation 
times  in  two  QDs  with  different  well  widths  W=  50 
A  (solid  circles)  and  100  A  (open  triangles),  where 
two  cases  are  considered,  one  for  2  electrons  (a) 


FIGURE  3  Relaxation  time  as  a  function  of  the  energy 
difference  between  the  initial  state  and  the  ground  state  for 
three  (a)  and  four  (b)  electrons’  QDs.  Large  open  circles  and 
small  solid  circles  denote  the  direct  relaxation  and  the  fastest 
relaxation,  respectively. 

and  the  other  for  3  electrons  (b)  in  the  QDs.  The 
change  in  the  well  width  W  results  in  the  change  in 
the  confining  energy  in  z  direction  and  in  no 
change  in  the  eigen  energy  in  x-y  direction.  The 
change  in  the  confinement  in  the  z  direction 
changes  the  maximum  phonon  wave  vector  #max 
~  2n/W  which  gives  the  maximum  phonon  energy 
hcsqmixx  [8]  where  cs  is  the  sound  velocity.  Maxi¬ 
mum  phonon  energy  is  4.3  meV  for  W—50A 


FIGURE  4  Electron  relaxation  rate  in  a  QD  containing  the 
number  of  electrons  N~  2(a)  and  3(b)  with  different  well  widths 
of  W =  50  A  (solid  circles)  and  100  A  (open  triangles). 
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FIGURE  4  (Continued). 


and  is  2.2  meV  for  W—  100  A.  It  is  seen  in  Figure  4 
that  the  relaxation  time  for  a  QD  with  W—  100  A 
begins  to  increase  at  the  energy  separation  of  2 
meV.  From  the  present  analysis  it  may  be 
concluded  that  the  change  of  the  confining  energy 
in  the  z  direction  results  in  a  change  in  the  electron 
relaxation  process  even  in  a  QD  with  several 
electrons. 


5.  CONCLUSION 

Electron  relaxations  toward  the  ground  state  via 
LA-phonon  scatterings  in  a  QD  with  interacting  N 
electrons  (from  N  =  1  up  to  4)  are  calculated  using 
N  electrons’  eigen  states  obtained  by  the  exact 
diagonalization  method.  We  find  that  energy  levels 
become  dense  with  increasing  the  number  of 
electrons  and  that  the  modification  of  relaxation 
times  is  not  so  strong  because  many  electron  eigen 
states  consist  of  only  a  few  dominant  single 
electron  states  which  limit  the  electron  relaxation. 
Several  fastest  processes  are  realized  by  relaxation 
through  intermediate  states  between  the  initial 
state  and  the  ground  state. 

We  find  that  the  confinement  in  the  z  direction 
limits  the  maximum  phonon  energy  and  relaxation 


time  becomes  longer  when  an  energy  separation 
between  the  initial  state  and  the  final  state  exceeds 
maximum  phonon  energy. 
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Self-Consistent  Calculations  of  the  Ground  State 
and  the  Capacitance  of  a  3D  Si/Si02  Quantum  Dot 
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We  perform  self-consistent  electronic  structure  calculations  in  the  framework  of 
inhomogeneously  and  anisotropically  scaled  local  density  functional  theory  of  a  fully 
3D  modeled  Si/Si02  quantum  dot.  Electrons  are  laterally  confined  in  the  semiconduc¬ 
tor/oxide  heterojunction  by  a  metallic  gate  atop  of  the  device.  Total  charge  densities, 
total  free  energies,  chemical  potentials  for  different  numbers  of  electrons  in  the  dot,  and 
the  differential  capacitances  for  various  dot  sizes  are  calculated.  We  observe  shell  filling 
effects  in  the  differential  capacitance.  The  mag/c-numbers  are  governed  by  the  six  valley 
bandstructure  of  silicon,  which  leads  to  four  fold  degenerated  single  particle  levels  in  the 
dot. 


Keywords:  Si/Si02  quantum  dots,  capacitance,  self-consistent  calculations,  shell-filling  effects 


1.  INTRODUCTION 

With  VLSI/ULSI  production  technology  rapidly 
approaching  the  100  nm  range,  simulation  of 
quantum  effects  plays  an  increasingly  important 
role  in  nowadays  device  simulation  efforts.  For 
most  of  the  state  of  the  art  device  simulation  tools 
available  today,  work  is  already  under  way  to 
include  the  effects  of  one  dimensional  quantum 
confinement  in  silicon  MOSFET  channels  [1]. 
Typically,  the  Schrodinger-Poisson  equation  is 
solved  and  corrections  to  the  charge  density, 
which  enters  the  semiclassical  device  equations 


are  given.  However,  with  device  features  below  the 
100  nm  range  in  silicon  arid  zero  dimensional 
confinement  in  quantum  dots  dominating  the 
device  operation,  this  concept  is  bound  to  fail  to 
give  accurate  descriptions. 

Throughout  the  last  decade  there  has  been 
intense  research  focusing  on  so  called  single 
electron  devices.  These  devices  utilize  effects  as 
Coulomb  blockade  and  single  electron  charging, 
which  arise  with  zero  dimensional  confinement. 
Various  concepts  are  pursued  which  encompass 
for  instance  single  electron  transistors  (SET)  [2]  or 
two  state  switching  cells  in  an  adiabatic  switching 
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paradigm  [3].  However,  all  these  concepts  suffer 
from  severe  limitations.  The  demands  on  the 
purity  of  materials  to  avoid  random  offset  charges 
will  be  difficult  to  meet.  Phase  coherent  switching 
as  in  the  Lent/Porod  cells  [3]  requires  virtually 
defect-free  materials  over  a  considerably  large 
spatial  extent.  Other  limitations  arise  from  the 
very  device  principles.  SETs  need  to  have  capaci¬ 
tances  in  the  aF  range  in  order  to  be  operated  at 
room  temperature.  Therefore,  the  switching  power 
to  achieve  reasonable  response  times  will  be 
unsuitably  large  for  large  scale  integration  [4]. 

A  different  concept  was  put  forward  by  IBM 
researchers  J.  J.  Welser  and  S.  Tiwari  [5].  A 
quantum  dot  is  embedded  in  the  gate  oxide  of  a 
field  effect  transistor  (FET).  The  dot  is  charged 
with  a  small  but  finite  number  of  electrons  and 
changes  the  threshold  voltage  of  the  FET  accord¬ 
ing  to  the  electron  number.  The  device  was  already 
shown  to  have  stable  room  temperature  operation 
by  various  groups  [6],  and  hence,  is  a  very  promi¬ 
sing  candidate  for  further  silicon  based  memory 
devices. 

One  of  the  key  requirements  for  device  simula¬ 
tion  arising  from  all  these  concepts  is  the  ability  to 
model  zero  dimensional  confinement.  Here,  we 
present  a  concept  and  first  results  of  self-consistent 
calculations  of  quantum  dots  in  Si/Si02  hetero¬ 
junctions.  The  model  dot  is  a  somewhat  artificial 
device,  however,  it  already  incorporates  features 
that  will  arise  with  more  realistic  structures. 


2.  MODEL  QUANTUM  DOT 

Our  model  consists  of  a  90  nm  thick  undoped  Si 
layer  and  a  10  nm  thick  Si02  layer.  The  lateral 
dimensions  of  the  supercell  are  50  nm  x  50  nm 
with  a  metallic  gate  on  the  bottom  covering  the 
whole  bottom  side.  Atop  of  the  structure  we 
modeled  another  metallic  square  gate,  the  size  of 
which  was  varied  between  10  nm  x  10  nm  and  20 
nm  x  20  nm.  The  gate  voltage  is  applied  between 
the  top  and  the  bottom  gate.  The  size  of  the  top 


gate  defines  the  lateral  size  of  the  quantum  dot  in 
the  Si/Si02  heterojunction. 


3.  NUMERICAL  METHOD 

We  used  a  nonuniform  tensor  product  mesh  with 
Dirichlet  boundary  conditions  at  the  gates  and 
Neumann  boundary  conditions  elsewhere.  The 
operators  are  discretized  using  a  finite  difference 
approach.  We  solve  the  coupled  Kohn-Sham 
(Schrodinger)-Poisson  equation  system  self-con- 
sistently  including  exchange  and  correlation  in  the 
local  density  approximation.  Weak  coupling  of  the 
dot  to  the  reservoir  (leads)  is  assumed.  This 
constraint  imposes  vanishing  wavefunctions  at 
the  boundaries  of  the  supercell.  The  six  valley 
bandstructure  of  silicon  is  taken  into  account  via 
the  kinetic  operator 

T=\^2wijPiPh  (3-1) 

iJ 

where  pj  =  —iH(d/dxj).  Wy  is  the  reciprocal  effective 
mass  tensor.  In  terms  of  the  transformation  matrix 
ay  from  the  principal  axes  of  a  constant  energy 
ellipsoid  we  write 

Pi  =  ^aikP'k  (3-2) 

k 

WU  =  Y1  aikajkw'kk  =  Wji  (3.3) 

k 

where  wkk  =  1  /m'k  are  the  principal  reciprocal 
effective  masses  of  the  semiconductor. 

Total  charge  densities,  total  free  energies  (F), 
chemical  potentials  (p)  for  different  numbers  of 
electrons  in  the  dot,  and  the  differential  capacit¬ 
ances  (Cd) 

F(N)  =  kT  In  ^ exp [~Fa/kT]SNa,  N  (3.4) 

Oi 

V(N)  =  F(N)-F(N-  1) 


(3.5) 
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C<t(N)  = 


V(N+  1)-m(A0 


(3.6) 


are  calculated.  Using  these  results,  we  are  able  to 
determine  the  number  of  electrons  in  the  dot 


N  =  - 


^ATQexp[-(Fa  -  fj,Na)/kT] 

a _ _ _ 

^exp[-(F„  -  fiNa)/kT] 


(3.7) 


as  the  thermodynamical  average  over  all  config¬ 
urations  of  occupation  numbers  a  =  {«;}.  Here,  fj, 
denotes  the  chemical  potential  in  the  reservoir 
(leads). 


Number  of  electrons 


FIGURE  1  Si/Si02  quantum  dot:  differential  capacitance  vs 
number  of  electrons  for  a  gate  size  of  10  nm  x  10  nm.  The  gate 
voltage  is  2  V.  The  inset  shows  a  plot  of  the  number  of  electrons 
vs  the  Fermi  level  in  the  leads. 


4.  RESULTS 

We  assumed  the  gate  to  be  perpendicular  to  the 
(100)  substrate  orientation.  Applying  the  trans¬ 
formations  of  equations  Eqs.  (3.1),  (3.2)  and  (3.3) 
we  obtain  three  different  Hamiltonian  operators, 
which  differ  in  their  kinetic  parts  by  different 
effective  masses  perpendicular  to  the  interface. 
However,  they  contain  the  same  effective  potential. 
The  single  particle  levels  are  four-fold  degenerate. 
This  is  in  contrast  to  the  spherical  total  energy 
surface  of  GaAs  where  only  spin  degeneracy  is 
present.  In  silicon  we  have  an  additional  (- k,  +  k ) 
degeneracy,  since  two  equivalent  valleys  occur  in 
the  -k  and  the  +  k  directions.  All  calculations  are 
performed  for  a  low  temperature  of  4.2  K,  neglec¬ 
ting  intervalley  splitting  effects. 

The  self-consistent  single  particle  potential  has 
shown  to  be  quasi  parabolic.  Consequently,  the 
single  particle  eigenvalue  spectrum  exhibits  a  shell 
structure  similar  to  the  harmonic  oscillator  spec¬ 
trum. 

Following  the  arguments  of  Macucci  et  al.  [7] 
for  GaAs  systems,  we  discuss  the  differential 
capacitance  curves  displayed  in  Figures  1  and  2. 
When  adding  electrons  to  the  dot,  the  free  energy 
increases  almost  linearly  when  adding  to  the  same 
shell,  however,  the  slope  increases  from  shell  to 
shell.  Therefore,  a  minimum  in  the  capacitance 


FIGURE  2  Si/Si02  quantum  dot:  differential  capacitance  vs 
number  of  electrons  for  a  gate  size  of  20  nm  x  20  nm.  The  gate 
voltage  is  2  V.  The  inset  shows  a  plot  of  the  number  of  electrons 
vs  the  Fermi  level  in  the  leads. 

occurs,  when  a  shell  is  filled.  In  contrast  to  GaAs, 
the  shell-filling  in  silicon  is  governed  by  the  four¬ 
fold  degenerate  single  particle  levels.  Since  the 
confinement  in  the  z-direction,  perpendicular  to 
the  Si/Si02  interface,  is  stronger  than  the  in-plane 
confinement  due  to  the  top  gate,  only  eigenstates 
from  the  Hamiltonian  with  the  longitudinal 
effective  mass  in  z-direction  contribute  to  the  total 
charge  at  low  electron  filling.  We  observe  a 
minimum  of  the  capacitance  at  the  4th  electron, 
which  is  due  to  the  complete  filling  of  the  first  four- 


234 


A.  SCHOLZE  et  al. 


30 

> 

|  29 

>,  __ 

- 1 - 1 - 1 - 1 - r 

- 1 - r 

m 

- 

O)  28 

tfr-BK dOft 

- 

\ 

c 

<5  27 

\- 

P'-V; 

- 

CD 

CM 

uo|i!P 

■O 

<  25 

- 1 - 1 - 1 _ L_ .  1 

- 1 _ L 

“ 

I - 1 - 1 _ i _ i  '  i  i  I 

0  2  4  6  8 

Number  of  electrons 


FIGURE  3  Si/Si02  quantum  dot:  addition  energy  vs  number 
of  electrons  for  a  gate  size  of  20  nm  x20  nm.  The  inset  shows 
the  principal  device  structure  and  a  plot  of  the  charge  density 
for  12  electrons  (cut  parallel  to  the  interface). 

fold  degenerate  eigenstate.  The  minimum  is  more 
pronounced  with  the  smaller  gate  (Fig.  1)  which 
leads  to  stronger  confinement  in  the  x-  and  y- 
directions.  The  larger  gate,  and  therefore,  weaker 
confinement  (Fig.  2)  leads  to  an  almost  linear 
increase  of  the  capacitance.  Shell-filling  effects  are 
suppressed,  and  the  dot  electrons  behave  more  like 
a  classical  circular  charge  distribution,  that  in¬ 
creases  its  capacitance  with  increasing  radius  as 
C=Seo£rr.  The  classical  value  for  the  capacitance 
of  this  dot  is  8  aF  (r=10  nm,  er=11.8),  which 
means  a  slight  overestimation  of  our  value  of  6  aF 
(A  =4).  However,  the  effective  size  of  the  dot 
underneath  the  gate  is  somewhat  smaller  than  the 
actual  gate  size. 

The  simple  picture  of  harmonic  oscillator-like 
shells  is  not  valid  when  higher  electron  numbers 
occur.  States  from  the  other  Hamiltonians  with 
different  symmetries  of  the  wavefunctions  then 
constitute  almost  arbitrary  shells.  This  is  especially 
evident  for  a  stronger  confinement  in  the  x-  and  y- 
direction  due  to  a  smaller  gates.  Some  shell-filling 
at  the  13th  electron  is  indicated  (Fig.  1). 

5.  CONCLUSIONS 

We  showed  that  shell-filling  effects  occur  in  silicon 
quantum  dots.  Due  to  the  larger  effective  masses, 


however,  these  effects  are  not  as  dominant  as  in 
GaAs  dots  of  the  same  size.  However,  since  room 
temperature  operation  of  quantum  dot  devices 
requires  a  considerable  level  separation  compared 
to  the  thermal  level  broadening,  overall  smaller 
device  features  are  needed  with  silicon  and  shell¬ 
filling  effects  will  dominate  again. 

Magic- numbers  are  governed  by  the  six  valley 
bandstructure  of  silicon.  Four-fold  degenerate 
single  particle  levels  are  occupied.  This  again  is 
in  contrast  to  GaAs,  where  only  spin  degeneracy  is 
present.  The  three  principal  directions  of  the 
constant  energy  surface  lead  to  a  system  of  three 
different  Schrodinger  equations,  which  have  to  be 
solved  self-consistently  with  respect  to  the  effective 
single  particle  potential.  The  problem  of  inter¬ 
valley  coupling  has  not  been  tackled  in  this  paper, 
however,  it  may  play  a  role.  Further  investigation 
in  this  problem  is  needed. 
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An  Interband  Tunnel  Oscillator:  Intrinsic  Bistability 
and  Hysteresis  of  Trapped  Hole  Charge 
in  a  Double-Barrier  Structure 

F.  A.  BUOT 

Naval  Research  Laboratory,  Washington ,  D.  C.  20375-5320 


We  introduced  a  novel  high-frequency  source  based  on  interband  tunneling.  A 
polarization-induced  oscillation  of  trapped-hole-charge  occurs  in  an  AlGaSb/InAs/ 
AlGaSb  resonant  tunneling  device.  Rate  equations  for  Zener  tunneling,  polarization, 
and  electron-hole  recombination  is  used  to  analyze  the  nonlinear  dynamics  of  this  device 
structure.  The  nonoscillatory  state  is  unstable  against  the  limit-cycle  operation.  The 
amplitude  of  trapped  hole  oscillation  increases  with  bias,  but  the  time-averaged  values 
can  be  approximated  by  a  step  function.  These  lead  to  the  hysteresis  of  the  averaged 
trapped  hole  charge  in  AlGaSb  barrier,  and  to  the  experimental  intrinsic  bistability  in 
AlGaSb/InAs/AlGaSb  resonant  tunneling  device.  Large-scale  time-dependent  simula¬ 
tion  of  quantum  transport  with  interband-tunneling  dynamics  is  needed  for  the  design 
optimization  of  this  novel  class  of  oscillator  useful  for  high-bandwidth  applications. 


Keywords:  Limit  cycle,  hysteresis,  trapped  holes,  intrinsic  bistability,  zener  effect,  resonant 
tunneling  devices,  self-oscillation 


1.  INTRODUCTION 

The  ‘hetero’  junction  has  become  the  basic 
building  block  of  most  of  the  advanced  high-speed 
devices  for  electronic,  microwave,  and  optoelec¬ 
tronic  applications  [1,2].  Moreover,  tunneling 
devices  exhibit  autonomous  oscillation,  similar  to 
Gunn  effect  devices  but  at  much  higher  frequencies 
in  nanometric  sizes.  For  conventional  resonant 
tunneling  devices  (RTD)  [3-6],  this  occurs  when 
the  device  is  operating  in  the  negative-differential- 
resistance  (NDR)  region,  just  after  the  resonant 
current  peak.  The  oscillation  addressed  in  this 


paper  occurs  before  the  resonant  current  peak, 
based  on  interband  tunneling  in  RTD  with 
staggered  band-gap  alignment. 

A  staggered  band-edge  alignment  can  be  rea¬ 
lized  by  using  InA/AlSb  hetero  junctions, 
Figure  la.  In  a  simple  implementation  of  a  novel 
interband  tunnel  high-frequency  source,  a  deeper 
quantum-well-for-holes  is  desirable  which  can 
support  a  localized  hole  state;  this  is  obtained  by 
using  In  As/AlGaSb  heterojunctions,  Figure  lb. 
Unless  otherwise  specified,  quantum  well  refers  to 
the  conduction  band  edge  and  conduction-band 
electrons. 
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FIGURE  1  (a)  Energy  band  edge  (EBE)  alignment  of  RTD 
using  InAs/AlSb  heterojunction,  (b)  Energy  band  edge  align¬ 
ment  of  RTD  using  InAs/AlGaSb  heterojunction.  Approxi¬ 
mated  band-edge  offsets  are  indicated  in  electron  volts. 

The  new  mechanism  of  modulating  the  resonant 
energy  level  in  the  quantum  well  with  respect  to 
the  energy  distribution  of  the  electrons  from  the 
emitter  can  simply  be  described  through  the 
oscillatory  build-up  and  decay  of  the  polarization 
pairing  between  electrons  in  the  quantum  well  and 
trapped  holes  in  the  barrier.  This  modulation  is 
controlled  by  trapped  holes  (similar  to  base 
charges  of  a  bipolar  transistor).  Thus,  for  the  first 
time  we  realize  an  autonomous  control  of  a 
significant  current  by  an  interband  process. 

We  refer  to  the  polarization  pair  as  a  duon ,  since 
this  Coulomb-correlated  e-h  pair  only  moves  in 


the  transverse  direction,  in  contrast  to  exciton. 
This  is  in  analogy  to  the  use  of  trion  in  referring  to 
a  correlated  exciton  and  electron  in  adjacent 
quantum-well  heterostructure  [11].  In  Sec.  3,  we 
introduce  the  physics  of  the  duon  dynamics.  The 
limit  cycle  solution  leads  to  an  oscillatory  voltage 
drop  between  the  quantum  well  and  the  barrier. 
Since  common  experimental  techniques  are  incap¬ 
able  of  investigating  these  oscillations  [3,  4,  8],  the 
current-voltage  {I-  V)  characteristic  is  also  calcu¬ 
lated  in  Sec.  4.  The  results  agree  with  the 
experiment.  In  Sec.  5,  we  draw  some  conclusions, 
as  well  as  give  a  summary  of  this  paper. 

2.  HIGH-FREQUENCY  OPERATION 

Under  bias,  when  the  localized  electrons  in  the 
AIGaSb  barrier  of  Figure  lb  see  the  available 
states  these  electrons  tunnel  to  the  drain  by  Zener 
transition,  leaving  behind  holes  in  a  discrete 
‘longitudinal’  energy  level,  en.  This  is  initiated 
when  a  matching  of  en  with  available  conduction- 
band  states  in  the  drain  first  occurs,  at  kf2  «  k £2 
in  Figure  2.  Figure  2  serves  to  define  several 
quantities  used  in  the  calculations  of  Sec.  4.  The 
drain  acts  as  a  sink  due  to  unoccupied  states  above 
kp  that  could  satisfy  the  conservation  of  transverse 
crystal  momentum  associated  with  en. 

As  hole  charging  occurs,  Figure  3  (1),  the 
polarization  Figure  3  (2)  creates  a  high-field 
domain,  at  the  expense  of  the  potential  drop 
between  the  contact  and  the  barrier.  When  the 
situation  shown  in  Figure  3  (3)  is  reached,  the 
onset  of  other  mechanisms  for  hole  discharging 
may  also  occur,  namely,  thermal  activation  of  the 
valence  electrons  in  the  continuum  to  recombine 
with  localized  holes,  or  loss  of  any  bound  hole 
states  in  the  barrier.  The  ‘hole  leakage’  will  restore 
the  large  voltage  drop  between  the  barrier  edge 
and  the  right  contact.  The  situation  shown  in 
Figure  3  (1)  is  revisited,  after  which  the  process 
repeats.  Therefore,  oscillations  of  the  hole  char¬ 
ging  of  the  AIGaSb  barrier  can  occur  at  high 
frequency  by  virtue  of  the  nanometric  features  of 
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FIGURE  2  Schematic  averaged  EBE  profile  showing  the 
various  quantities  used  in  the  calculations  of  /—  V  plot. 
The  shaded  regions  in  the  lower  right-hand  comer  indicate 
the  occupied  transverse  and  longitudinal  momentum  states  in 
the  drain. 


the  device.  We  estimate  the  charging  time,  which  is 
the  dominant  time  scale  of  the  problem,  to  be 
about  200-1000  femtoseconds  for  a  heavy-hole 
state  in  AlGaSb  with  indirect-gap  interband 
tunneling  through  Keldysh  effect. 

The  above  dynamical  process  limits  the  amount 
of  hole  charge  that  can  be  trapped  in  the  barrier  as 
a  function  of  bias.  The  interband  recombination 
process  can  not  compete  with  the  conduction-band 
electron  tunneling  process  through  the  barrier  [3]. 
The  criteria  for  either  detailed  balance  or  oscilla¬ 
tory  behavior  are  governed  by  the  two  character¬ 
istic  times,  namely,  the  polarization-charge  build¬ 
up  time,  tb,  and  the  charge-leakage  time,  tl.  If 
tb>tl,  then  the  charging  process  will  be  lagging 
behind  the  discharging  process  and  oscillations 


FIGURE  3  (1)  e-h  generation  by  Zener  tunneling.  (2)  duon 
generation  is  through  an  autocatalytic  process.  (3)  three 
mechanisms  for  hole  discharging  are  mentioned  in  the  text. 


will  result.  This  holds  true  in  conventional  RTD 
and  single-electron  devices  [10].  It  is  here  estimated 
that  ts>tl ,  by  virtue  of  several  possible  fast  hole- 
discharge  channels  mentioned  above.  The  ‘book¬ 
eeping  equations’  (‘chemical  kinetics’  modeling)  of 
the  device  operation  follows. 


3.  POLARIZATION  PAIRING  DYNAMICS 

We  derive  here  the  coupled  rate  equations  for  the 
processes  in  Figure  3.  The  duon  formation  is  via 
the  polarization-induced  transport  of  conduction 
electrons  from  the  emitter  to  the  quantum  well, 
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coupled  with  a  succeeding  e— h  generation  by 
Zener  tunneling.  In  process  (2),  the  Zener-tunneled 
electron  flows  to  the  metallic  contact  of  the  drain, 
and  this  is  substituted  by  the  tunneled  conduction 
electron  from  the  emitter  to  form  a  duon.  We 
observe  here  an  ‘autocatalytic’  or  positive-feed¬ 
back  aspect  of  the  duon  formation. 

Let  G  be  the  maximum  rate  of  e-h  generation 
by  Zener  tunneling.  Note  that  G  is  a  direct 
measure  of  the  applied  bias.  The  duon  generation 
rate  with  ‘three  interacting  components’  can  be 
expressed  as  A Jf\P,  where  JfB  is  the  concentra¬ 
tion  of  ‘unpaired’  holes  which  is  equal  to  the 
concentration  of  ‘exiting’  electrons  produced  by 
Zener  tunneling,  P  is  the  concentration  of  duons , 
and  A  is  a  parameter  which  is  expected  to  acquire, 
in  appropriate  ranges,  a  nonlinear  dependence  on 
P  as  discussed  below.  We  can  now  write  the 
‘effective’  generation  rate  of  unpaired  trapped 
holes  in  the  barrier  as 

^jtrB  =  G-AjrlP.  (i) 

The  total  concentration  of  trapped  holes  in  the 
barrier,  QB ,  at  any  time  is  given  by  QB  =  JfB  +  p. 

For  large  P  the  transfer  of  conduction  electrons 
from  the  emitter  to  the  quantum  well  becomes 
more  efficient  since  it  is  approaching  the  resonance 
peak  and  A  increases.  For  very  small  P9  the 
concentration  of  resident  electrons  already  existing 
in  the  quantum  well  (refer  to  Fig.  4)  will  also 
render  the  polarization  pairing  to  be  much  more 
efficient,  and  hence  a  larger  A,  than  for  the 
intervening  ranges  between  small  P  and  large  P. 
This  acquired  nonlinearity  of  the  parameter  A  is 
important  in  establishing  a  limit  cycle  operation  of 
the  device,  since  it  limits  the  growth  of  the  solution 
from  the  unstable  focus.  Indeed,  we  shall  see  that 
the  ‘linear’  criterion  for  unstable  stationary 
operation  is  that  F(G)  >  4A  where  F  (G)  is  a 
constant  for  a  fixed  bias. 

The  decay  rate  for  P  is  expected  to  saturate  for 
very  large  P .  In  all  foreseeable  cases,  we  may 
express  the  decay  rate  of  the  duon  concentration  as 


FIGURE  4  More  efficient  generation  of  initial  polarization 
pairs  or  duons  is  due  to  the  initial  resident  excess  electron  in  the 
quantum  well  for  a  given  applied  bias. 


=  <xP/(l  +  (3P),  where  1//?  represents  the  sum  of 
available  states  in  the  valence  band  of  the  quantum 
well  region  and  the  states  participating  in  thermal 
recombination;  otherwise  it  represents  the  actual 
concentration  of  hole  states  in  the  barrier  in  the 
case  of  the  loss  of  bound  hole  state.  This  is  similar 
to  the  decay  law  ubiquitous  in  chemical  kinetics 
[12].  The  parameter  a  is  the  decay  rate  constant 
and  a  1/3  is  the  value  of  the  saturated  decay  rate  of 
duons.  Therefore,  we  can  now  write  the  duon 
generation  rate  as 


dt 


p=kjrip~- 


aP 

1  +/3P ' 


(2) 


As  seen  in  Eq.  (4)  below,  the  physical  situation 
corresponds  to  a//3  >  G.  Indeed,  we  can  estimate 
that  1/tb  »  G  «  G  and  1/tl  «  ai 3.  Therefore 
a/ 3  >  G  implies  that  tb>  tl. 


3.1.  Stability  Analysis 

The  stationary  solution  for  a  fixed  bias  (implying  a 
fixed  G)  to  Eqs.  (1)  and  (2)  is  given  by 


G  —  Kjf2BP  — 


aP 

1  +0P' 


(3) 
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The  total  stationary  trapped  hole  concentration, 
Qb ,  is  thus  given  by 


Qb  =  ^°b  +  P°  = 


a  —  0G 


a- f3G 


r 


(4) 


which  is  a  sum  of  an  increasing  and  a  decreasing 
function  of  bias.  The  more  accurate  average  value 
under  a  limit  cycle  oscillation  is  shown  in  Sec.  3.2 
to  be  approximately  independent  of  bias.  Since  QB 
and  P°  are  constants,  the  duon  production  rate  is 
via  the  transfer  of  conduction  electrons  from  the 
emitter  to  the  quantum  well  and  the  duon  decay 
rate  is  via  transfer  of  conduction  electrons  from 
the  quantum  well  to  the  drain.  Thus,  the  d.c. 
operation  no  longer  involves  interband  processes, 
as  schematically  shown  in  Figure  5. 

Complete  analysis  of  the  stability  of  the 
stationary  solution  as  well  as  the  full  derivation 
of  the  limit  cycle  solution  will  be  given  elsewhere. 
By  transforming  to  dimensionless  variables:  11  = 
PP,  S  =  PJTB,  A  =  A(1  /(3)2/a,  9  =  Glial p) 
<  1.0,  the  criterion  for  unstable  stationary  solu¬ 
tion  is  that  (1  -  ^)3>  4A.  Following  a  perturba¬ 
tion  technique  using  multiple  time  scales  [13, 14], 
we  have  obtained  to  second  order  in  the  smallness 
parameter,  e  =  ({A  -  A^/A^)^2,  where  Ac  = 
((1  -  ^)3/4)  and  A2  comes  from  the  expansion 
of  A  near  Ac  in  powers  of  e,  the  limit  cycle 
solution.  This  is  given  as 


FIGURE  5  At  steady  state  the  two  conduction-electron- 
mediated  duon  decay  and  generation  processes  are  balanced 
resulting  in  steady-state  current  across  the  double  barrier 
structure. 


(S)-(5M:M::) 

(5) 

The  two  column  vectors  (^°)  and  (^)  contain  the 
oscillating  factors  associated  with  first  and  second 
orders,  respectively.  We  note  that  (£■)  and  higher- 
order  terms  also  contain  higher-order  time-inde¬ 
pendent  terms  which  increases  with  bias.  There¬ 
fore,  the  average  value  of  Q  is  given  by 

7T°  \ 

1  +  higher-order  corrections, 

(6) 

where  the  leading  higher-order  corrections  comes 
from  these  time-independent  terms. 


3.2.  Dependence  with  Bias 

Based  on  a  finite  limit  of  the  amplitude  (which 
only  depends  on  the  slow  time  scale),  the  limit 
cycle  solution  is  found  to  occur  within  the  range  of 
values  of  the  parameter  A  where  the  ‘linear’ 
criterion  for  unstable  focus  (1  —  ^)3  >  4A  still 
holds.  This  is  analogous  to  the  numerically 
simulated  limit  cycle  of  AlGaAs/GaAs/AlGaAs 
double-barrier  heterostructure  operating  in  the 
NDR  region  [7].  For  values  of  A  near  the  critical 
point,  the  rate  of  change  with  respect  to  3?  is 
approximately  zero.  Indeed,  we  have  from  Eq.  (4), 

worn  i  (  i  y/2<0 

<&  (1-9)2  V4A(1  -9)J  ~  ’ 

(7) 

where  the  equality  is  obtained  at  A  =  Ac  = 
(1  -  ^)3/4.  Since  0  is  our  measure  of  the  applied 
voltage  applied,  we  conclude  from  Eq.  (7)  and  by 
taking  into  account  the  higher-order  correction 
terms  which  increases  with  ^  that  the  average  total 
hole  charge  trapped  in  the  barrier  is  approximately 
independent  of  bias. 
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Denoting  the  leading  time-dependent  part  of 
Qb(0  as  6QB(t),  the  total  trapped  hole  charge  in 
the  barrier  oscillates  with  amplitude  that  increases 
with  bias  and  is  given  by 


«e*(0  = 


4(1  A- Ac  |)1/2 
0 


1/2 


L(l-^)4|{-8-17^  +  36^2-ll^3}|J 
[cosf It  —  sin(0/  +  $)] 


(8) 


where  the  frequency  O  consists  of  a  function  of  ^ 
plus  higher-order  terms,  and  $  =  {^/  (1  -  ^)}^2 
>  0.  The  oscillation  amplitude  grows  in  response 
to  the  increasing  maximum  electric  field  in  the 
depletion  region  with  the  applied  bias,  since  the 
maximum  e-h  generation  rate,  37,  by  Zener 
tunneling  [15]  increases  with  bias. 


4.  INTRINSIC  BISTABILITY 
IN  InAs/AljGai-jSb  RTD 

The  time-averaged  hole  charge  in  the  barrier  is 
referred  to  as  <2(AlGaSb)  =  eQB,  where  e  is  the 
positive  unit  charge.  This  value  is  approximately 
independent  of  bias  in  Sec.  3,  after  an  abrupt 
increase  at  kf 2  ~  in  Figure  2.  The  self- 
consistency  of  the  potential  alone,  in  Figure  2, 
demands  that  the  polarization  and  hence  g(Al- 
GaSb)  increases  monotonically  with  bias.  We  shall 
see  that  the  simultaneous  solution  to  these  two 
requirements,  plus  the  continuity  condition,  leads 
to  a  ‘parallelepiped’  hysteresis  of  the  trapped  hole 
charge  in  the  barrier. 

We  only  need  three  field  parameters  to  include  a 
concave  EBE  profile  in  the  barrier  region.  The 
inflection  point  is  assumed  to  have  a  measure  zero 
as  far  as  the  integration  of  the  fields  to  obtain  the 
total  voltage  drop  across  the  device  is  concerned. 
For  nonzero  average  value  of  JfB,  which  is  the 
concentration  of  “unpaired”  trapped  holes,  we 
also  expect  a  nonzero  superposed  polarization 


between  the  barrier  and  spacer  layer  to  be  affecting 
the  potential  profile,  as  indicated  by  a  simple 
‘kink’  in  the  spacer  region  of  Figure  2.  We  estimate 
the  fourth  field  parameter  in  the  second  half  of  the 
barrier  as  proportional  to  ER,  as  the  figure 
suggests  with  proportionality  factor,  \iV)  <  10, 
and  still  maintain  the  physical  requirement  of 
concave  EBE  profile  in  this  region.  This  accounts 
for  nonzero  average  JfB. 

From  Figure  2,  the  trapped  hole  charge  in  the 
second  barrier  is  given  by  the  expression:  \Er  ~ 
EB  =  £?(AlGaSb)/£,  from  the  Poisson  equation. 
We  estimate  the  proportionality  factor  \  Is  close  to 
unity  and  positive.  From  the  requirement  of  faster 
voltage  drop  in  the  barrier  region  in  Figure  2,  we 
must  have  EB  more  negative  than  \Er,  thus  we 
obtain  Q(AlGaSb)  >  0  consistent  with  the  trapped 
hole  charge  in  the  second  barrier. 


4.1.  Hysteresis  of  Trapped  Hole 
Charge 

The  positive  applied  bias,  V,  in  Figure  2,  is  given 
by  the  following  expression,  eV  —  e\EL\{b  -f  w/2) 
+e\ER\(w/2)  +  Eg  +  {h2kf  /2m*)  -  Ehh ,  where  w 
and  b  are  the  width  of  the  quantum  well  and 
barrier,  respectively.  We  use  the  Poisson  equation 
to  eliminate  |is*|  in  terms  of  \EL\.  Since  all  fields  on 
the  average  have  negative  sign  for  positive  applied 
voltage,  we  may  also  write  Poisson  equation  as 
\Er\  -  \El\  =  \Qw\/e,  \Eb\  -  x  \Er\  =  <2(AlGa  Sb)/e. 
Therefore,  we  obtain  the  following  expression  for 
the  trapped  hole  charge, 


^(AlGaSb)  =  — 
ew 


(Eg  -  Ehh) 

L 

(eV~e\Et\^b  + 


h'O 


2  m* 


(9) 


The  trapped  hole  charge  is  an  increasing 
function  of  kf2  in  Eq.  (9),  since  the  ‘polarization’. 
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without  the  constraint  of  quantum  transport 
nonlinear  dynamics,  should  increase  monotoni- 
cally  with  bias. 

The  independence  with  bias  beyond  a  threshold 
value  of  the  trapped  holes  in  Sec.  3  is  expressed 
here  by  a  step  function 

g(AlGaSb)  =  QhS(kf 2  -kf ).  (10) 

The  simultaneous  solutions  of  Eqs.  (9)  and  (10) 
is  shown  graphically  in  Figure  6,  Eq.  (9)  for 
g(AlGaSb)  vs.  kf 2  is  approximated  by  positive 
sloping  lines.  Upon  applying  the  continuity  con¬ 
dition,  open  circles  and  solid  circles  are  solutions 
for  the  increasing  voltage  sweep  and  decreasing 
voltage  sweep,  respectively.  A  ‘parallelepiped’ 
hysteresis  of  trapped  hole  charge  as  a  function  of 
bias  is  clearly  indicated. 


and  \E'r\  —  \EB\  =  /3'|2(AlGaSb)/£,  where  a'  and 
/ 3 '  proportionality  constants.  Thus  we  may  write 
eV=e\EL\(b+w/2)+  e\E,R\(w+b)/2+e\ElB\(b/2+c). 
We  then  expressed  \E'R\  and  \EB\  in  terms  of  \QW\, 
2(AlGaSb),  and  We  also  use  the  relation: 
e\EL\  =  e(V  -  <t>c)/(2b  +  w),  where  e(j>c  =  eV- 
Ew  +  f^kf  /2m*  to  obtain  the  result 

\Qw\  =  f  - - Y 

a'((Z>  +  c)+fJ 


/3'Q(AlGaSb) 

2i{bjl  +  ~c) 


(11) 


where  we  have  £  =  (26  +  w  +  c)/(26  + w).  The 
quantum  transport  requirement  for  Qw  was  given 
by  Buot  and  Rajagopal  [9,10]  as 


4.2.  Hysteresis  in  the  I-  V  Characteristics 

This  is  obtained  by  describing  the  whole  length  of 
the  device  by  three  independent  fields,  namely,  EL , 
Er  and  EfB.  Note  that  the  field  \Er  used  before  is 
only  valid  in  the  right-half  of  the  barrier  region,  by 
virtue  of  nonzero  average  concentration  of  ‘un¬ 
paired’  trapped  holes,  jV'b,  as  indicated  in  Figure 
2.  The  field  EL  is  as  defined  before,  whereas  ER 
is  defined  by  the  relation:  ER(b/ 2  +  w/2)  = 
Er(w/2)  +  EB(b/2),  and  EB  is  the  constant  field 
approximation  for  the  rest  of  the  device  of 
dimension  [(b/2)  +  c\.  As  a  consequence,  we  also 
have  the  following  relation:  \ER\  -  \EL\  —  a'\Qw\/e 


which  is  zero  for  k ^  <  0,  1  /rd=  \/re+  \jrc ,  where 
1  jrc  is  the  effective  rate  of  decay  of  Qw  into 
unoccupied  collector  states  and  l/re  is  equal  to  the 
rate  of  supply  of  electrons  from  the  emitter  to  the 
quantum  well.  The  simultaneous  solution  of  Eqs. 
(11)  and  (12)  is  also  graphically  obtained  as  shown 
in  Figure  7,  In  Figure  7a,  Eq.  (11)  is  approximated 
by  parallel  sloping  lines.  The  values  for  Q(A1- 
GaSb)  are  solutions  obtained  from  Figure  6  which 
create  an  offset  in  the  sloping  lines  of  Eq.  (11), 
leading  to  higher  values  of  Qw  as  indicated  by  the 
dotted  arrows.  For  the  increasing  voltage  sweep, 
the  solutions  for  Qw  are  given  by  the  intersection 
points  S\ ,  52,  Sf , Sf ,  ,  Sf (low),  $6 (high),  S7  and 
S  8.  For  the  reverse  voltage  sweep,  the  correspond¬ 
ing  solution  points  are  S%,S7,S6,Sf, 

[high],  S2 [low],  and  back  to  S\. 

The  RTD  current  can  be  approximated  by  QJ 
rc.  The  resulting  I-  V  has  all  the  salient  features  of 


FIGURE  6  Graphical  solution  of  Eqs.  (9)  and  (10). 
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FIGURE  7  (a)  Graphical  solution  of  Eqs.  (11)  and  (12).  (b) 
The  solution  for  the  I-  V  characteristic  showing  ‘parallelepiped’ 
hysteresis  occurring  before  the  RTD  current  peak,  in  agreement 
with  the  experiment. 


the  experimental  results  [9],  shown  in  Figure  7b 
with  corresponding  solution  points  indicated.  This 
result  serves  as  indirect  experimental  evidence  on 
the  ability  of  this  device  to  function  as  a  high- 
frequency  source. 

5.  CONCLUSIONS 

The  nonlinear  dynamics  of  coupled  systems  of 
duons  and  ‘unpaired5  trapped-holes  in  RTD  with 
staggered  band-gap  alignment  provides  a  funda¬ 
mental  physical  basis  of  the  experimental  intrinsic 
bistability  of  AlGaSb/InAs/AlGaSb  RTD.  Zener 
tunneling,  stimulated  formation  of  duons ,  and  the 


self-oscillation  of  the  trapped  hole  charge  provide 
the  mechanism  behind  a  new  high-frequency 
source  introduced  in  this  paper.  We  note  that  the 
RTD  current  peak  occurs  when  the  quantum-well 
energy  level  aligns  with  the  bottom  of  the 
conduction  band  of  the  emitter.  In  some  emitter 
designs,  the  Fermi  level  adjacent  to  the  barrier 
increases  as  this  alignment  is  approached,  leading 
to  a  high-transconductance  with  the  high-field 
domain  acting  as  a  self-gate.  Thus,  the  transcon¬ 
ductance  of  this  ‘self-gated’-transistor  oscillator 
can  be  made  large,  yielding  a  novel  high-frequency 
source  with  a  usable  power. 

The  ‘chemical  kinetics’  modeling  used  above  to 
analyze  the  device  operation  adds  to  the  store  of 
analytical  tools  for  characterizing  complex  de¬ 
vices.  However,  large-scale  time-dependent  simu¬ 
lation  is  needed  for  further  research  and  design 
optimization.  Besides  potential  applications  in 
communications,  defect  engineering  would  also 
make  it  as  a  triggering  element  in  semiconductor 
lasers. 
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Tunneling  between  vertically  stacked  quantum  wires  has  been  investigated.  The  wires 
are  assumed  to  have  the  dimension  perpendicular  to  the  tunneling  barrier  much  smaller 
than  the  other  transverse  dimension,  so  that  only  the  lowest  mode  in  such  direction  is  to 
be  taken  into  account,  while  many  modes  in  the  other  direction  are  filled.  A  model  with 
hard-wall  confinement  has  been  used  for  the  investigation  of  the  transport  problem,  and 
the  tunneling  conductance  has  been  computed,  via  a  recursive  Green’s  functions 
procedure. 


Keywords;  Quantum  wire,  tunneling,  stacked  2DEGs,  green’s  functions 


1.  INTRODUCTION 

The  possibility  of  fabricating  heterostructures 
containing  two  weakly  coupled  and  independently 
contacted  2DEGs  (two-dimensional  electron 
gases)  has  originated  several  proposals  for  possible 
devices  exploiting  controlled  tunneling  between 
such  2DEGs.  The  basic  idea,  proposed  by 
Sakakibara  et  ai[  1],  consists  in  fabricating  two 
stacked  2DEGs  with  different  thicknesses,  so  that 
in  normal  conditions  the  ground  states  for  the  two 
quantum  wells  are  not  aligned.  By  applying  a 
potential  to  a  metallic  gate  obtained  on  top  of  the 
heterostructure,  it  is  possible  to  alter  the  potential 
landscape  and  to  line  up  the  two  ground  states,  so 
that  tunneling  will  be  possible.  If  we  want  to 


observe  interference  effects  dependent  on  the 
length  of  the  metallic  gate,  the  situation  is, 
however,  more  complex  than  what  is  discussed  in 
Ref.  [1].  While  in  a  quantum  wire  the  number  of 
transverse  modes  is  limited  or  we  can  even  achieve 
monomode  propagation,  in  a  2DEG  the  number 
of  transverse  modes  is  not  limited  and  the 
continuous  distribution  of  wavevectors  may  tend 
to  wash  out  interference  effects.  In  particular,  the 
dependence  of  the  tunneling  conductance  on  the 
gate  length  will  differ  from  that  reported  in  Ref.  [1] 
whenever  the  number  of  propagating  modes  is 
significantly  larger  than  one. 

This  happens  because  each  pair  of  modes 
contributing  to  conductance  will  be  associated 
with  a  different  transfer  length,  corresponding  to 
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the  length  over  which  complete  transfer  from  one 
waveguide  to  the  other  occurs.  Therefore,  as  a 
function  of  the  coupling  length,  the  tunneling 
conductance  will  not  exhibit  a  periodic  behavior, 
but,  rather,  a  superposition  of  different  periodi¬ 
cities.  In  the  limiting  case  of  two  laterally 
unconfined  2DEGs  the  number  of  occupied 
transverse  subbands  is  practically  infinite  and 
any  periodic  pattern  as  a  function  of  gate  length 
should  disappear.  However,  if  the  transverse 
dimension  is  limited  by  etching,  thus  defining 
two  stacked  quantum  wires  instead  of  two  stacked 
2DEGs,  it  is  possible  to  limit  the  number  of 
transverse  modes  contributing  to  conduction,  and 
therefore  to  preserve  some  sort  of  periodic  or 
quasi-periodic  behavior.  This  is  the  case  investi¬ 
gated  in  the  present  paper:  we  study  the  tunneling 
conductance  between  two  quantum  wires  with 
rectangular  cross-section  (with  a  large  aspect  ratio, 
greater  than  10)  separated  by  a  barrier  along  the 
longer  side. 


2.  MODEL 

Our  model  consists  of  two  vertically  stacked 
quantum  wires  with  a  rectangular  cross  section, 
as  shown  in  Figure  1.  At  the  outer  ends,  the  two 
quantum  wires  are  separated  by  hard  walls  and  are 
therefore  completely  independent,  so  that  the 
electron  wave  functions  are  completely  localized 


Top  wire 


FIGURE  1  Sketch  of  the  stacked  quantum  wire  structure. 


in  each  of  them,  while  in  the  “coupling  region” 
they  are  separated  by  a  tunneling  barrier  of  finite 
height.  The  central  section  of  the  coupling  region 
corresponds  to  the  area  covered  by  the  metallic 
gate  used  to  shift  the  potentials  of  the  two 
quantum  wires.  This  section  is  connected  to  the 
outer  sections  by  means  of  a  region  with  graded 
potentials,  with  a  length  corresponding  to  that 
over  which  the  gate  potential  is  effectively 
screened.  In  order  to  reduce  the  computer  time 
used  for  the  simulations  and  considering  that  most 
of  the  relevant  physics  is  preserved  unchanged,  we 
have  included  only  one  intermediate  section 
between  the  central  region  and  the  outer  ends, 
with  a  potential  landscape  corresponding  to  an 
average  of  those  relative  to  the  adjacent  sections. 
The  gate  bias  is  assumed  to  tilt  the  potential 
linearly  along  the  vertical  direction.  In  order  to 
solve  the  transport  problem  along  this  structure, 
we  use  the  quasi-3D  approach  that  we  developed 
for  the  investigation  of  electron  waveguide  cou¬ 
plers  [2,  3]:  the  structure  is  divided,  along  the 
longitudinal  direction,  into  a  number  of  slices, 
each  of  which  has  a  constant  potential  profile.  The 
Schrodinger  equation  is  solved  within  each  slice 
and  the  eigenvalues  and  eigenfunctions  thus  ob¬ 
tained  are  used  to  evaluate  the  tunneling  conduc¬ 
tance  by  means  of  a  recursive  Green’s  function 
approach  [2], 

In  the  case  of  the  present  structure  we  are  faced 
with  an  additional  problem:  the  effective  mass  in 
the  region  corresponding  to  the  barrier  dividing 
the  two  quantum  wells  is  different  from  that  in  the 
wells  themselves.  Let  us  assume  a  reference  system 
with  y  along  the  vertical  direction,  perpendicular 
to  the  tunneling  barrier,  z  along  the  longitudinal 
direction,  parallel  to  current  flow,  and  x  along  the 
horizontal  transverse  direction  (see  Fig.  1).  The 
Schrodinger  equation  in  each  slice  reads: 

-n1  ay  _  i  dip' 

2m*  {y)  dx2  2  dy  m*{y)  dy 

ti1  d2ip  ,  ^ 
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We  can  try  a  solution  of  the  type  ip  =  x  (*,  y) 
exp  (ikz)  (more  generally  it  would  be  ip  =  x  (*>  k) 
[A  exp  (ikz)  +  B  exp  (—ikz)]):  the  transverse  equa¬ 
tion  (on  the  x-y  plane)  can  then  be  written  as 


-ft2  d2x(*,.y) 

2m*  (y)  dx2 


tf_d_ 

2  dy 


1  dx(x,y)  1 


m*  (_y)  dy 

-{-k2)x{x,y) 


2m*  (j>) 

+  V(x,y)x(x,y)  =Ex(x,y). 


(2) 


If  we  define  A m*(y)  =  m*(y)  -  m*Q,  where  m*0  is 
the  effective  mass  in  the  quantum  wells,  and 
Ez  =  h2/(lml)k2,  we  get: 


-7z2  d2x{x,y)  "ft2  d 


2m*  (y)  dx 2 


1  dx(x,y) 


-E 


2  dy  [ m*(y) 
A  m*(y) 


mZ  +  Am*(y) 


dy  J 
x(x,y) 


+  V(x,y)x{x,y )  =  (E-Ez)x(x,y). 

(3) 


We  notice  that  if  the  term  EzAm*(y)/((m*0) 
+Am*(y))  can  be  neglected  compared  to  V(x,  y), 
this  is  just  a  standard  2D  Schrodinger  equation 
with  variable  effective  mass.  Such  term  is  nonzero 
only  within  the  barrier,  where  Am*(y)/(m*0+ 
Am*(y))  is,  however,  always  smaller  than  0.5 
(0.44  for  GalnAs/AlInAs  and  0.264  for  GaAs/ 
AlGaAs).  Furthermore  it  is  clearly  Ez  <  Et,  where 
Ef  is  the  Fermi  level.  Thus,  as  long  as  we  choose 
values  of  the  Fermi  level  that  are  less  than  one  half 
of  the  barrier  height,  the  transverse  problem  can 
be  reduced  to  that  of  solving  a  conventional  2D 
Schrodinger  equation,  without  introducing  too 
large  an  error. 


3.  NUMERICAL  PROCEDURE 
AND  RESULTS 

The  Schrodinger  equation  in  each  transverse  slice 
is  discretized  on  a  uniform  finite-difference  mesh, 
and  the  eigenvalue  problem  is  solved  with  an 
efficient  iterative  Chebishev-Arnoldi  technique. 


The  typical  size  of  the  mesh  varies  between  20  x 
50  and  20  x  350  points,  and  the  lowest  eigenvalues 
up  to  the  50th  are  evaluated.  The  presence  of  a 
varying  effective  mass  along  the  y  direction  is 
treated  with  the  standard  formalism  ensuring  the 
continuity  of  the  wave  function  and  of  its  normal 
derivative  at  the  interface  [4],  as  shown  in  Eq.  (1). 
The  transmission  coefficients  between  the  two 
waveguides  are  then  computed  with  the  modified 
recursive  Green’s  function  method  [3],  and  finally 
averaged  over  the  derivative  of  the  Fermi  function, 
in  order  to  take  into  account  the  effects  of 
temperature  [5]  (all  of  our  calculations  have  been 
performed  for  a  temperature  of  1.6  K). 

We  have  taken  into  consideration  the  parameter 
values  for  two  different  material  systems:  a) 
GalnAs/AlInAs  and  b)  GaAs/AlGaAs.  For  mate¬ 
rial  system  a)  we  have  considered  an  effective  mass 
m*= 0.042  m0  in  GalnAs  and  m’  +  Am*  =  0.075 
mo  in  the  barrier  region,  m0  being  the  mass  of  the 
free  electron.  A  value  of  0.5  eV  has  been  chosen  for 
the  conduction  band  discontinuity.  For  material 
system  b)  an  effective  mass  m*  —  0.067  mo  has  been 
considered  in  GaAs  and  m*  +  Am*=  0.091  mo  in 
the  barrier  region.  The  conduction  band  disconti¬ 
nuity  has  been  assumed  to  be  0.3  eV. 

In  all  the  cases  we  have  studied,  the  thicknesses 
of  the  heterostructure  layers  have  been  kept 
constant,  while  the  width  along  x  of  the  quantum 
wires  has  been  varied.  The  thickness  of  the  upper 
wire  along  y  is  9  nm,  that  of  the  lower  wire  7  nm 
and  that  of  the  barrier  4nm.  All  the  energies  are 
indicated  with  reference  to  that  of  the  lowest  edge 
of  the  bottom  quantum  wire. 

In  Figure  2  we  report  the  tunneling  conductance 
between  the  input  of  one  wire  and  the  output  of 
the  other,  in  units  of  the  conductance  quantum 
G0=  2  e2lh,  for  a  GalnAs/AlInAs  structure  with 
widths  of  50  nm  (dotted  line),  100  nm  (dashed  line) 
and  200  nm  (solid  line).  The  tunneling  conduc¬ 
tance  is  plotted  versus  the  potential  shift  applied  to 
the  upper  edge  of  the  top  quantum  wire  (the 
potential,  as  discussed  before,  is  assumed  to  vary 
linearly  in  the  vertical  direction).  We  have 
considered  only  positive  values  of  such  shift,  since 
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Applied  potential  shift  (meV) 


FIGURE  2  Tunneling  conductance  (in  units  of  the  conduc- 
tance  quantum)  for  the  GalnAs/AlInAs  structure  versus  the 
applied  potential  shift,  with  wire  widths  of  50  nm  (dotted  line), 
100  nm  (dashed  line)  and  200  nm  (solid  line).  The  Fermi  level 
has  been  set  at  0.24  eV. 


the  top  wire  is  thicker  than  the  bottom  one  and 
thus  it  needs  to  be  raised  to  a  higher  potential  in 
order  to  line  up  the  energy  levels  in  the  two  wires. 
The  coupling  length  is  200  nm. 

For  the  results  shown  in  this  figure,  a  Fermi 
level  of  0.24  eV  has  been  considered  and  up  to  50 
transverse  modes  have  been  included  in  the 
calculation.  A  clear  peak,  reaching  a  maximum 
for  a  bias  of  approximately  198meV,  is  visible, 
corresponding  to  the  line-up  of  the  energy  levels 
associated  with  the  subbands  in  the  two  wires.  At 
this  point  it  is  useful  to  draw  a  comparison 
between  the  situation  for  devices  in  which  the 
coupled  wires  are  obtained  by  lateral  confine¬ 
ment  [6]  in  the  same  2DEG  and  the  present 
situation.  In  the  former  case  all  the  subbands  line 
up  for  the  same  bias  condition  only  if  the  two  wires 
are  identical  and,  even  in  such  a  case,  only  the 
highest  ones  give  a  significant  contribution  to  the 
tunneling  current,  because  they  have  the  largest 
transverse  wave  vectors,  in  the  direction  orthogo¬ 
nal  to  the  potential  barrier.  In  the  latter  case, 
instead,  the  spacing  between  the  subbands  is 
determined  by  the  width  of  the  wire  along  x, 
which  is  the  same  for  both  wires,  and  therefore  all 
of  them  line  up  at  the  same  time.  More  im¬ 
portantly,  all  the  modes  give  a  similar  contribution 


to  the  tunneling  current,  because  the  wave  vector 
orthogonal  to  the  tunneling  barrier  is  the  same  for 
all  of  them.  The  result  is  that  if  the  width  of  the 
wire  increases,  the  height  of  the  conductance  peak 
also  increases,  due  to  the  contribution  of  a  larger 
number  of  modes,  but  its  shape  or  position  does 
not  vary  significantly. 

In  Figure  3  we  report  the  tunneling  conductance 
results  for  the  GaAs/AlGaAs  material  systems,  for 
structures  50  nm  (thin  line)  and  100  nm  (thick  line) 
wide.  Also  in  this  case  the  coupling  length 
(corresponding  to  the  length  of  the  biasing  gate) 
is  200  nm,  while  the  Fermi  level  has  been  set  at 
0.15eV.  The  peak  is  located  approximately  at 
95  meV:  a  value  much  lower  than  that  in  Figure  2, 
because  of  the  significant  difference  in  the  effective 
mass  value. 

We  have  then  investigated  the  behavior  of  the 
tunneling  conductance  as  a  function  of  the 
coupling  length.  If  only  one  mode  were  contribut¬ 
ing  to  the  tunneling  current,  we  would  expect  the 
sinusoidal  oscillation  predicted  also  by  the  coupled 
mode  theory  used  for  the  analysis  of  optical 
waveguides;  in  this  case,  however,  there  are  many 
contributing  modes,  each  of  which  is  associated 
with  a  different  transfer  length.  As  a  result,  the 
tunneling  conductance  exhibits  some  complex 
structure,  depending  on  the  number  of  modes 


Applied  potential  shift  (meV) 


FIGURE  3  Tunneling  conductance  (in  units  of  the  conduc¬ 
tance  quantum)  for  the  GaAs/AlGaAs  structure  versus  the 
applied  potential  shift,  with  wire  widths  of  50  nm  (thin  line)  and 
lOOnm  (thick  line).  The  Fermi  level  has  been  set  at  0.15eV. 
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and  on  the  relationships  between  the  related 
transfer  lengths. 

In  Figure  4  the  tunneling  conductance  is  plotted 
versus  the  length  of  the  coupling  region,  for  the 
GalnAs/AlInAs  material  system  and  for  three 
different  choices  of  the  parameters.  Two  curves 
have  been  obtained  for  a  width  of  50  nm  and  a 
Fermi  level  of  0.22  eV  (dashed  line)  and  of  0.24  eV 
(thin  line).  The  third  curve  (thick  line)  is  for  a 
width  of  200  nm  and  a  Fermi  level  of  0.24  eV.  In  all 
three  cases  the  applied  potential  shift  is  196  meV. 
A  rather  complex  structure  can  be  observed,  due 
to  the  simultaneous  presence  of  several  pairs  of 
coupled  modes,  each  of  which  is  characterized  by  a 
different  transfer  length.  For  the  lowest  value  of 
the  Fermi  level  we  have  an  almost  monomode 
propagation  and  the  tunneling  conductance  oscil¬ 
lates  sinusoidally. 

Results  for  the  GaAs/AlGaAs  material  system 
are  reported  in  Figure  5,  where  the  thick  curve  and 
the  thin  curve  have  been  obtained  for  a  Fermi  level 
of  0 . 1 6  eV  and  wire  widths  of  200  nm  and  of  50  nm, 
respectively.  The  dashed  curve  is  instead  for  a  wire 
width  of  50  nm  and  a  Fermi  level  of  0.13eV.  The 
applied  potential  shift  is  100.5  meV.  Also  for  this 
material  system  we  observe  a  transition  from  a 


FIGURE  4  Tunneling  conductance  (in  units  of  the  conduc¬ 
tance  quantum)  for  the  GalnAs/AlInAs  structure  versus  the 
coupling  length,  with  a  wire  width  of  50  nm  and  a  Fermi  level 
set  at  0.24  eV  (thin  line)  or  at  0.22  eV  (dashed  line).  The  thick 
line  represents  the  results  for  a  wire  width  of  200  nm  and  a 
Fermi  level  set  at  0.24  eV.  For  all  three  cases  the  applied  bias 
potential  is  196  meV. 


FIGURE  5  Tunneling  conductance  (in  units  of  the  conduc¬ 
tance  quantum)  for  the  GaAs/AlGaAs  structure  versus  the 
coupling  length,  with  a  wire  width  of  50  nm  and  a  Fermi  level 
set  at  0.16eV  (thin  line)  or  at  0.13  eV  (dashed  line).  The  thick 
line  represents  the  results  for  a  wire  width  of  200  nm  and  a 
Fermi  level  set  at  0.16eV.  For  all  three  cases  the  applied  bias 
potential  is  100.5  me V. 

quasisinusoidal  dependence  of  the  tunneling  cur¬ 
rent  on  the  coupling  length  (when  just  one  pair  of 
modes  contributes)  to  a  rather  complex  behavior. 

4.  CONCLUSIONS 

We  have  investigated  the  tunneling  conductance 
between  two  stacked  quantum  wires,  as  a  function 
of  an  applied  transverse  electric  field  and  of  the 
coupling  length.  The  conductance  peak  that  is 
observed  when  the  subbands  in  one  wire  line  up 
with  those  in  the  other  wire  has  a  height  depending 
on  the  total  number  of  occupied  subbands.  Thus  a 
relatively  low  “on”  resistance  can  be  achieved, 
which  makes  the  investigated  structure  suitable  for 
operation  as  a  controlled  switch. 
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Electron  transport  properties  of  strained-Si  on  relaxed  Sii_xGex  channel  MOSFETs 
have  been  studied  using  a  Monte  Carlo  simulator.  The  steady-  and  non-steady-state 
high-longitudinal  field  transport  regimes  have  been  described  in  detail.  Electron- 
velocity-overshoot  effects  are  studied  in  deep-submicron  strained-Si  MOSFETs,  where 
they  show  an  improvement  over  the  performance  of  their  normal  silicon  counterparts. 
The  impact  of  the  Si  layer  strain  on  the  performance  enhancement  are  described  in 
depth  in  terms  of  microscopic  magnitudes. 

Keywords:  Strained  Si  layer,  electron  velocity  overshoot,  conduction  effective  mass  reduction, 
intervalley  scattering  rate  reduction,  Monte  Carlo  simulation 


1.  INTRODUCTION 

Recently,  both  theoretical  and  experimental  works 
have  shown  important  electron  mobility  enhance¬ 
ment  when  silicon  is  grown  pseudomorphically  on 
relaxed  Si^Ge*  at  different  temperatures.  The 
strain  causes  the  six-fold  degenerate  valleys  of  the 
silicon  conduction  band  minimum  to  split  into  two 
groups:  two  lowered  valleys  with  the  longitudinal 
effective  mass  axis  perpendicular  to  the  interface, 
and  four  raised  valleys  with  the  longitudinal  mass 
axis  parallel  to  the  interface.  This  splitting  reduces 
the  intervalley  phonon  scattering  rate  compared 
with  that  of  unstrained  silicon.  In  addition,  in  the 


lowered  valleys,  which  are  more  populated  in  the 
strained  case,  electrons  show  a  smaller  conduction 
effective  mass  (transverse  mass)  in  transport 
parallel  to  the  interface.  The  combination  of  the 
light  effective  mass  and  reduced  intervalley  scatter¬ 
ing  gives  rise  to  higher  electron  mobility  [1]. 
Moreover,  the  lower  intervalley-scattering  rates 
make  the  energy  relaxation  times  higher,  originat¬ 
ing  important  electron  velocity  overshoot.  These 
advantages  can  be  used  to  improve  MOSFETs 
parameters,  taking  advantage  both  of  the  higher 
carrier  mobility  and  the  higher  electron  velocity 
overshoot,  thus  greatly  improving  short  channel 
MOSFET  transconductance. 
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2.  MONTE  CARLO  SIMULATOR 

We  have  developed  a  strained-Si-«-type  Monte 
Carlo  simulator  by  adapting  a  previous  un- 
strained-Si  one  which  includes  inversion-layer 
quantization  and  a  nonparabolic  band  model  [2]. 
The  quantization  effects  are  included  by  solving 
the  Poisson  equation  coupled  with  the  one¬ 
dimensional  Schrodinger  equation  all  along  the 
channel.  The  value  of  the  conduction  band  offset 
for  the  four-fold  in-plane  bands  over  the  value  for 
the  two-fold  out-of-plane  bands  was  0.67x  eV, 
where  x  was  the  Ge  mole  fraction  [3]  and  the  valley 
shape  was  not  modified  by  the  strain.  The  two- 
dimensional  Poisson  equation  is  solved  through¬ 
out  the  MOSFET  to  account  for  short-channel 
effects.  Phonon,  surface-roughness  and  Coulomb 
scattering  rates  have  been  evaluated  at  each  point 
of  the  channel  [3].  The  electron  energy  for  all  our 
simulations  was  always  under  0.5  eV,  and  so  a 
nonparabolic  simplified  band  structure  can  be  used 
to  accurately  describe  the  Si  band  structure  [4]. 

3.  RESULTS 

We  have  simulated  strained-Si  on  relaxed  Sij„* 
Ge*  long-channel  MOSFETs  in  order  to  obtain 
the  steady-state  electron  velocity  and  energy 
curves  versus  the  longitudinal-electric  field  (see 
Fig.  1).  After  having  fixed  the  transverse-electric 


Longitudinal  Electric  Held  (V/cm)  Longitudinal  Electric  Field  (V/cm) 


FIGURE  1  Steady-state  drift-electron-velocity  curves  (a)  and 
electron-average  energy  (b)  versus  longitudinal-electric  field  for 
Eefr=l.lx  10s  V/cm  at  T  =  300  K.  (A:  x  =  0;  ■:  x=0.2; 
*:x  =  0.4).  The  high  effective  field  chosen  is  typical  of  very 
short  channel  MOSFETs  where  high  doping  profiles  and  thin 
oxides  are  used. 


field  we  increased  the  longitudinal  field  step  by 
step,  recording  the  most  important  transport 
magnitudes  when  the  steady-state  was  reached. 
The  saturation  velocities  obtained  were  almost  the 
same  for  all  Ge  mole  fractions:  1.1  xlO7  cm/s  at 
300  K  and  1.4 xlO7  cm/s  at  77  K,  although  they 
were  a  bit  higher  («l-2%)  for  x  >  0. 

The  energy-  and  momentum-relaxation  times 
can  be  calculated  making  use  of  the  data  shown  in 
Figure  1.  Different  relaxation  times  are  obtained 
as  x  changes.  The  momentum-  and  energy-relaxa¬ 
tion  times,  for  different  Ge  mole  fractions  and 
temperatures  are  shown  in  Figure  2.  The  momen¬ 
tum-relaxation  times  are  smaller  than  the  energy 
ones.  The  difference  between  the  momentum-  and 
energy-relaxation  times  produces  nonlocal  elec¬ 
tron  transport  effects  such  as  electron-velocity 
overshoot  [5].  These  effects  are  expected  to  occur 
on  a  time  scale  shorter  than  the  energy-relaxation 
time.  It  is  foreseeable  that  the  higher  energy- 
relaxation  times  observed  in  Figure  2  as  x  rises 
lead  to  increased  electron  velocity-overshoot  eff¬ 
ects  and  therefore  higher  MOSFET  transconduc¬ 
tances  as  channel  dimensions  are  reduced.  The 
relaxation  times  are  lower  at  low  temperature  than 
at  room  temperature  and  therefore  electron 
velocity  overshoot  effects  are  higher  at  low  temp¬ 
erature  as  it  shown  in  Figure  3. 

We  have  studied  velocity-overshoot  effects  by 
applying  a  sudden  longitudinal-electric  field  of 
2xl05  V/cm  to  a  steady-state  electron  distribution 
achieved  under  the  influence  of  a  longitudinal- 
electric  field  of  1  x  104  V/cm.  The  time  evolution  of 


Electron  Energy  (eV)  Electron  Energy  (eV) 


FIGURE  2  Energy-  and  momentum-relaxation  times  versus 
electron  energy  for  Eefr  =  1.1  xlO5  V/cm  at  r  =  300K  (a)  and 
T  =  11K  (b).  (A:  *  =  0;  ■:  x  =  0.2;  *  :x  =  0.4). 
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FIGURE  3  Transient  overshoot  velocity  with  a  sudden 
application  of  the  field  2xl05  V/cm  at  room  and  low 
temperature  for  £eff  =7.7xl05  V/cm.  Unstrained-silicon  (solid 
line),  x  =  0.2  (short  dashed  line),  x  =  0.4  (long  dashed  line). 


the  electron  velocity  is  shown  in  Figure  3.  It  is 
clear  that  the  time  taken  to  reach  the  steady-state 
velocity  corresponding  to  a  longitudinal-electric 
field  of  2x  105  V/cm  increases  as  x  rises.  This  result 
is  coherent  with  the  energy-relaxation  times  shown 
in  Figure  2.  It  is  clear  that  the  increase  of  the 
energy-relaxation  times  as  x  rises  causes  the  time 
taken  to  reach  the  steady-state  to  be  longer  and 
hence  enhances  the  velocity-overshoot  effects. 

We  have  simulated  several  0.1pm  channel 
length  MOSFETs  for  x  =  0, 0.1, 0.2  and  0.3  (the 
thickness  of  the  strained  Si  layer  is  4.6  nm).  The 
MOSFET  external  bias  was  FDS  =  0 .5  V,  FGs=  1.5 
V,  and  FSb  =  0.  The  velocity  distribution  along  the 
channel  obtained  for  each  Ge  mole  fraction  is 
plotted  in  Figure  4  at  room  temperature.  The 
electron  velocity  is  higher  than  the  saturation 
velocity  for  strained  and  unstrained-Si  channel 
MOSFETs  near  the  drain  edge.  This  effect  is  due 
to  the  high  longitudinal  electric-field  gradient  the 
carriers  face  as  they  travel  toward  the  drain.  This 
gradient  makes  the  electrons  overshoot  the  velo¬ 
city  they  would  have  if  they  were  subject  to  steady- 
state  transport. 
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FIGURE  4  Electron  velocity  versus  channel  position  at 
r  =  300K.  Velocity  distribution  obtained  in  an  0.1  pm  MOS¬ 
FET  with  Fgs  =  1.3  V  and  FDS  =  0.5V  for  x  =  0  (solid  line), 
x=0.1  (long  dashed  line),  x-0.2  (medium  dashed  line),  x  =  0.3 
(short  dashed  line).  Low  longitudinal-electric-field  velocity 
corrected  using  Thomber’s  expression  and  the  longitudinal 
field  distribution  obtained  for  the  previous  MOSFET  with  x=0 
(Squares). 


4.  CONCLUSIONS 

A  Monte  Carlo  simulator  has  been  used  to  study 
the  electron  transport  properties  of  strained-Si  on 
relaxed  Sii_^Gex  channel  MOSFETs  and  the 
performance  improvement  of  these  devices  at 
high-longitudinal  fields.  Similar  saturation  velo¬ 
cities  are  obtained  no  matter  the  value  of  the  Ge 
mole  fraction,  however  the  electron  velocity  over¬ 
shoot  effects  increase  as  the  Ge  mole  fraction  rises. 
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Electron  transport  properties  in  SiC  quantized  inversion  layers  have  been  studied  by 
means  of  a  Monte  Carlo  procedure.  It  has  been  observed  that  the  contribution  of  polar- 
optical  phonon  scattering  produces  a  significant  influence  of  the  effective-electric  field 
on  the  high  longitudinal  field  transport  regime,  this  being  the  main  difference  of  SiC 
with  respect  to  standard  Si  inversion  layers.  The  energy-  and  momentum-relaxation 
times  have  been  calculated  and  the  results  suggest  that  electron  velocity  overshoot 
effects  are  less  important  than  in  Si  MOSFETs.  The  electron  mobility  is  not  very 
different  from  their  silicon  counterparts,  but  the  saturation  velocity  is  higher. 


Keywords:  Silicon  carbide,  inversion  layer,  high  field  transport  properties,  electron  mobility,  polar- 
optical  phonon  scattering 


1.  INTRODUCTION 

Cubic  silicon  carbide  (/?- SiC)  is  thought  to  be  a 
material  well-suited  for  electronic  devices  operat¬ 
ing  under  harsh  conditions  such  as  high  tempera¬ 
ture,  high  power,  and  high  frequency  due  to  its 
high  saturated  drift  velocity,  wide  band  gap,  and 
high  thermal  conductivity  [1-2].  The  fabrication 
of  high  temperature  metal-oxide-semiconductor 
field-effect  transistors  (MOSFETs),  junction  field- 
effect  transistors,  and  metal-semiconductor  field- 
effect  transistors  has  already  been  demonstrated 

[3]. 


As  a  consequence  of  this  interest,  theoretical 
analyses,  and  simulations  on  the  electronic  proper¬ 
ties  of  silicon  carbide  have  begun  to  appear  in  the 
last  few  years.  However,  up  to  now,  numerical 
simulations  related  to  the  electron  transport  prop¬ 
erties  of  silicon  carbide  inversion  layers  have  not 
considered  electron  quantization.  The  wide  experi¬ 
ence  obtained  in  silicon  and  other  semiconductor 
material  inversion  layers  over  many  years  suggests 
that  for  a  complete  evaluation  of  the  performance 
potential  of  silicon  carbide  MOSFETs,  a  detailed 
analysis  of  the  transport  properties  of  quantized 
silicon  carbide  inversion  layers  is  essential. 
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2.  MONTE  CARLO  SIMULATOR 

Electron  transport  properties  such  as  electron 
velocity  and  mobility  at  room  arid  higher  tem¬ 
peratures  were  studied  by  using  a  Monte  Carlo 
simulator  we  have  developed  [4,  5].  The  position  of 
the  subband  minima  and  the  electron  concentra¬ 
tion  of  each  of  them  were  obtained  by  the 
self-consistent  solution  of  the  Poisson  and  Schroe- 
dinger  equations  for  each  value  of  the  effective- 
transverse-electric  field.  A  non-parabolic  band 
model  was  assumed  taking  a  =  0.323eV_1  [2], 
which  limits  our  study  to  low-electron  energies 
(below  leV).  We  have  allowed  the  electron  to 
travel  in  six  subbands  and  to  move  between  them. 
Acoustic,  intervalley-optical  and  polar-optical 
phonons,  surface-roughness  and  Coulomb  scatter¬ 
ing  due  to  both  bulk  impurities  and  interface- 
trapped  and  oxide-trapped  charge  have  been  taken 
into  account. 


3.  RESULTS 

We  have  simulated  a  SiC  MOSFET  with  an  oxide 

o 

thickness  of  150  A  and  a  substrate  impurity 
concentration  NA=  1  xlO16 cm”3.  We  have  paid 
special  attention  to  the  role  that  polar-optical 
phonon  scattering  plays  in  the  total  scattering 
rates.  We  have  studied  the  ratio  R  between  the 
total  scattering  rates  with  and  without  the  inclu¬ 
sion  of  polar-optical  phonon  scattering  mechan¬ 
isms.  It  has  been  plotted  in  Figure  1  for  the  lowest 
energy  subbands  versus  electron  kinetic  energy.  As 
can  be  observed,  the  lower  the  effective-electric 
field  the  higher  the  ratio  R  for  a  wide  range  of 
electron  kinetic  energy.  Therefore,  the  energy 
relaxation  times  are  dominated  by  polar-optical 
phonon  scattering  at  low  effective-electric  fields. 

We  have  simulated  long-channel  /3-SiC  MOS- 
FETs  to  obtain  velocity  curves  versus  longitudi¬ 
nal-electric  field  (Fig.  2).  As  can  be  seen,  the  drift 
velocity  curves  reach  a  maximum  and  then  tend  to 
decrease  as  the  drift-electric  field  increases.  There 
is  a  strong  dependence  of  the  velocity  curves  on  the 
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FIGURE  1  Ratio  R  between  the  total  scattering  rates  taking 
and  not  taking  into  account  polar-optical  phonon  scattering  for 
the  first  primed  (0)  and  unprimed  (O')  subbands  at  T=  300  K  for 
two  different  effective-electric  fields:  £EFF=105  V/cm  (solid 
line)  and  £EFF=8xl05  V/cm  (dashed  line). 
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FIGURE  2  Homogeneous  steady-state  electron  drift  velocity 
curves  in  /?- SiC  at  room  temperature  for  different  transverse- 
effective  fields,  (solid:  isEFF-lxl05  V/cm,  short  dashed: 
^eff-SxIO5  V/cm,  long  dashed:  Eeff=  1  xIO6  V/cm)  for 
r=300K. 

effective-electric  field,  this  fact  is  connected  with 
the  results  of  Figure  1  (the  lower  the  transverse 
field  the  higher  the  saturation  velocity).  The 
saturation  velocity  is  higher  than  in  silicon,  which 
is  one  of  the  main  reasons  explaining  the  potential 
importance  of  silicon  carbide  as  an  electron  device 
material,  in  addition  to  its  wide  band  gap. 
Unfortunately  there  are  no  available  experimental 
data  to  compare  with  our  simulation  data. 

Making  use  of  the  steady-state  velocity  and 
energy  plots  we  have  calculated  (Fig.  3)  the  energy- 
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FIGURE  3  Momentum-  and  energy-relaxation  times  versus 
electron  energy  for  a  j3- SiC  MOSFET  at  T=  300  K  (solid  line) 
£eff=105  V/cm,  (short  dashed  line)  Eeff  =  6.7x105  V/cm, 
(long  dashed  line)  Eeff  =  106  V/cm. 


and  momentum-relaxation  times  at  room  tem¬ 
perature  by  means  of  the  analytical  expressions 
given  in  Ref.  6.  This  calculation  has  been  done  by 
using  an  average  effective  mass  taking  into 
consideration  the  fraction  of  electrons  in  the 
primed  and  unprimed  subbands  for  each  effective 
field.  The  SiC  relaxation  times  are  approximately 
one  order  of  magnitude  lower  than  the  silicon 
ones,  implying  that  the  electron-velocity-overshoot 
effects  are  going  to  be  less  important  than  in 
silicon  MOSFETs.  At  this  point,  it  is  worth 
highlighting  the  importance  of  the  effective-electric 
field  on  the  high  longitudinal-electric-field  trans¬ 
port  properties  in  SiC  MOSFETs.  There  exists  a 
strong  dependence  of  the  saturation  velocity 
(homogeneous  transport  in  the  channel)  and  of 
the  relaxation  times  (inhomogeneous  transport  in 
the  channel,  velocity  overshoot)  on  the  effective- 
electric  field,  which  is  more  important  than  in 
standard  silicon  MOSFETs. 

We  have  simulated  and  plotted  in  Figure  4  the 
mobility  curves  versus  the  transverse-effective  field 
at  room  temperature  assuming  (a)  only  phonon 
scattering  (solid  line),  (b)  phonon  and  surface- 
roughness  scattering  (solid  line  and  closed 
squares),  (c)  Coulomb  scattering  due  only  to 
substrate  doping  impurities  (solid  line  and  trian¬ 


FIGURE  4  Electron  mobility  curves  in  /3-SiC  versus  trans¬ 
verse  effective  field  at  room  temperature:  (a)  only  phonon 
scattering  (solid  line),  (b)  phonon  and  surface-roughness 
scattering  (solid  line  and  full  squares),  (c)  Coulomb  scattering 
due  only  to  substrate  doping  impurities  (solid  line  and  full 
triangles),  and  (d)  assuming,  in  addition  to  what  is  included  in 
(c),  a  typical  interface  charged  layer  of  Nit  —  1  x  101 1  cm-2  (solid 
line  and  full  circles).  Electron  mobility  curve  in  Si  (dashed  line 
and  full  triangles). 


gles),  and  (d)  assuming,  in  addition,  a  typical 
interface  charged  layer  of  Nit=  1  xlO 11  cm~2 
(closed  circles).  A  mobility  curve  for  a  Si 
MOSFET  corresponding  to  case  (c)  is  included 
in  dashed  line  and  triangles.  When  only  phonon 
scattering  is  taken  into  account,  the  electron 
mobility  at  low  transverse-electric  fields  ap¬ 
proaches  the  SiC  bulk-mobility  value  (1000  cm2/ 
Vs).  The  reduction  of  the  average  distance  between 
the  inversion  charge  and  the  oxide  interface 
increases  surface-roughness  and  Coulomb  scatter¬ 
ing.  In  this  respect,  surface-roughness,  Coulomb, 
and  phonon-scattering  probabilities  (the  last  one  is 
higher  in  SiC  due  to  the  additional  contribution  of 
polar-optical  phonons),  which  are  the  three  main 
scattering  mechanisms  that  limit  electron  mobility 
in  the  channel  of  a  MOSFET,  are  higher  in  SiC 
than  in  Si.  It  is  clear  that  SiC  MOSFETs  are  by  no 
means  candidates  to  substitute  conventional  Si 
ones  at  room  temperature.  However,  at  high 
temperature,  where  Si  MOSFETs  can  no  operate 
well,  SiC  devices  play  their  major  role. 
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4.  CONCLUSIONS 


Electron  transport  properties  have  been  evaluated 
in  SiC  quantized  inversion  layers.  It  has  been 
shown  that  the  influence  of  the  effective-electric 
field  on  the  high  longitudinal  field  transport 
regime  is  significant,  mainly  due  to  the  contribu¬ 
tion  of  polar-optical  phonon  scattering  which  is 
essential  at  low  effective  fields  for  low  and  room 
temperatures.  The  electron-velocity-overshoot  ef¬ 
fects  are  less  important  than  in  Si  MOSFETs.  The 
electron  mobility  is  not  too  different  from  their 
silicon  counterparts,  but  the  saturation  velocity  is 
nevertheless  higher.  So,  this  transport  feature,  in 
addition  to  a  wide  band  gap,  and  high  thermal 
conductivity,  make  SiC  a  promising  candidate  for 
high-temperature/high-power  applications. 
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Determining  the  Dependence  of  the  Electron  Mobility 
on  the  Longitudinal-Electric  Field  in  MOSFETs 
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A  new  experimental  method  for  determining  the  dependence  of  the  electron  mobility  on 
the  longitudinal-electric  field  has  been  developed.  The  development,  validation  and 
explanation  of  this  new  method  has  been  carefully  carried  out.  We  have  applied  this 
procedure  to  standard  submicron  MOSFETs  and  after  having  obtained  the  mobility 
dependence  on  both  the  transverse-  and  longitudinal-electric  fields  we  reproduced  the 
experimental  output  curves.  The  saturation  velocity  has  also  been  calculated  using  the 
mobility  curves  obtained  by  this  new  method. 


Keywords:  Electron  mobility,  longitudinal-electric  field,  saturation  velocity,  output  curves,  drift- 
diffusion  simulation 


1.  INTRODUCTION 

The  electron-mobility  dependence  on  the  longi¬ 
tudinal-electric  field,  oriented  along  the  channel, 
has  been  studied  experimentally.  A  “closed-loop” 
investigation  has  been  performed  on  the  validity 
and  accuracy  of  a  new  method  of  determining  the 
reduction  of  the  channel  mobility  connected  to  the 
increase  of  the  longitudinal  field.  This  method  is 
applicable  to  standard  transistors.  An  estimation  of 
the  saturation  electron  velocity  in  the  channel  has 
been  accomplished  comparing  a  widely  used  ex¬ 
pression  [1].  The  data  obtained  with  this  procedure 


can  be  used  to  develop  local  models  to  account  for 
the  mobility  dependence  on  the  longitudinal-electric 
field  in  drift-diffusion  simulators. 


2.  DESCRIPTION  OF  THE  METHOD 

The  basic  concept  of  idea  of  this  method  is  the 
comparison  of  experimental  drain  current  versus 
drain  voltage  curves  I^-Vds,  measured  in  short- 
channel  transistors  to  the  results  produced  by  a 
two-dimensional  simulator  [2,  3]  in  which  an 
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accurate  low-field  mobility  is  used  and  the  mobility 
reduction  caused  by  the  longitudinal-electric  field 
is  included.  The  total  channel  length  L  is  divided 
in  N  subchannels  with  lengths  Lh  i—  1,  , 

respectively.  Instead  of  fixing  the  Lt  values,  the 
channel  potential  values  at  the  ends  of  the  i- th 
subchannel,  K,-_  i  and  Vh  are  defined  as  the 
separation  between  the  quasifermi  levels  at  these 
points,  V0=  F^and  VN  =  VD  being  the  potentials 
at  the  total  channel  ends.  Therefore,  the  separation 
into  subchannels  is  obtained  by  dividing  VD~VS 
into  very  small  intervals  and  fixing  the  channel 
potential  at  the  ends  of  each  subchannel.  The 
number  of  subchannels  is  high  enough  and  the 
difference  Vj—V^i  in  each  subchannel  is  low 
enough  that  the  longitudinal-electric  field  in  each 
subchannel  can  be  considered  to  be  uniform.  The 
application  of  the  method  is  as  follows: 

1)  Starting  from  a  very  small  VDS,  for  which  one 
subchannel  is  enough,  the  gate-to-source  vol¬ 
tage  is  varied  and  the  low-field  mobility  as  well 
as  its  dependence  on  the  transverse-electric  field 
is  obtained. 

2)  VDS  is  increased  in  very  small  steps  and  the 
experimental  result  is  compared  to  the  simu¬ 
lated  result  calculated  by  using  the  low-field 
mobility  obtained  in  point  (1).  New  subchan¬ 
nels  are  introduced  if  necessary.  When  the 
experimental  and  computed  currents  deviate 
more  than  a  prefixed  (also  very  small)  limit,  61 , 
a  mobility  reduction  is  apparent  at  the  sub¬ 
channel  placed  at  the  drain  end  of  the  channel. 
Then,  the  mobility  in  this  subchannel  is  multi¬ 
plied  by  a  factor  of  A  <  1  in  order  to  reproduce 
the  experimental  current  again.  The  values  for 
the  transverse-  and  longitudinal-electric  fields 
in  this  subchannel  are  introduced  in  an  array: 
^trans  (1)  and  EXong  (1),  respectively. 

3)  VDS  is  increased  again.  In  all  the  subchannels 
for  which  the  longitudinal-electric  field  is 
greater  than  E\ong  (1),  the  low-field  mobility 
is  multiplied  by  A  and  the  current  thus  obtained 
is  compared  to  the  experimental  one.  When  the 
deviation  between  the  computed  and  experi¬ 


mental  currents  is  again  greater  than  61,  a 
further  reduction  is  assumed  in  the  drain  end  of 
the  channel  and  the  mobility  is  once  more 
multiplied  by  A,  thus  obtaining  a  value  equal  to 
the  low-mobility  multiplied  by  A2.  The  trans¬ 
verse-  and  longitudinal-electric  fields  in  this 
subchannel  are  introduced  as  a  new  element  of 
the  array:  EtTans  (2)  and  E\ong  (2),  respectively. 

At  the  end  of  this  process,  the  mobility  values 
for  a  set  of  pairs  (£trans>  ^long)  is  obtained.  This  is 
schematized  with  the  vertical  solid  line  plotted  in 
Figure  1. 


3.  VALIDATION  PROCEDURE 

Once  we  developed  the  algorithm  to  obtain  the 
dependence  of  the  mobility  on  the  longitudinal- 
electric  field  we  proceeded  to  validate  it.  To  do  so, 
we  simulated  several  output  curves,  using  our 
drift-diffusion  simulator  for  a  MOSFET  and  a 
supposed  mobility  dependence,  after  applying  our 
method  we  were  able  to  reproduce  this  depen¬ 
dence,  confirming  the  correctness  of  the  procedure. 
(Fig.  2). 


FIGURE  1  Tables  (£t rans(i>  ^iong(o)  used  to  account  for  the 
mobility  reduction  as  the  longitudinal  field  rises.  They 
correspond  to  four  different  output  curves.  The  solid  line 
represents  a  common  chosen  effective  field  for  all  the  tables. 
This  value  will  be  employed  to  obtain  the  mobility  versus 
longitudinal  field  curves. 
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FIGURE  2  Mobility  curves  versus  longitudinal-electric  field 
obtained  with  Equation  1  to  simulate  output  curves  and 
validate  this  method.  The  vSat  chosen  was  vSat  =  1.1  xlO7 
cm/s,  (3  =  1  (solid  line),  (3  =  1.5  (long  dashed  line),  (3  -  2 
(short  dashed  line).  The  results  of  our  method  are  plotted  in 
squares  for  /?  =  1,  triangles  for  (3  =  1.5  and  stars  for  (3  =  2. 


4.  EXPERIMENTAL  RESULTS 
AND  DISCUSSION 

After  having  validated  the  accuracy  of  our 
method,  we  applied  it  to  Si  MOSFETs  fabricated 
by  Lucent  Technologies  with  LEFf  =  0.6  pm,  W^ff 
=  20|xm,  Tox  =  157  A,  NA  =  1.2  x  1017cm-3  and 
/?sd  =  130  n.  We  have  applied  our  technique  to  a 
set  of  measured  output  curves  with  61  =  10  3  mA 
and  A  =  0.96.  The  results  obtained  can  be  observed 
in  Figure  3,  where  the  mobility  versus  effective  and 
longitudinal  field  surface  is  shown.  Expression  1  was 
used  to  obtain  the  /3  parameter  and  the  saturation 
velocity  vSat- 
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The  values  that  gave  a  better  fit  were  ft  =  1  and 
vsat  —  1.1  xlO7 cm/s.  These  values  were  indepen¬ 
dent  of  the  effective  field,  confirming  the  result  by 
Modelli  and  Manzini  [4].  Using  Thornber’s 
expression  with  these  parameters  we  have  tried  to 


FIGURE  3  Experimental  electron  mobility  as  a  function  of 
the  effective-  and  longitudinal-electric  fields  at  room 
temperature. 


reproduce  the  experimental  output  characteristics. 
The  result  is  shown  in  Figure  4. 

The  data  provided  by  this  new  procedure  are 
very  useful  in  order  to  obtain  local  mobility 
models  for  drift-diffusion  simulators,  where  the 
dependence  of  the  longitudinal-electric  field  is 
included.  The  influence  of  the  series  resistance 
and  its  bias  dependence  can  be  taken  into  account 
easily.  It  is  also  possible  to  assess  the  influence  of 
electron  velocity  overshoot  effects  in  very  short 
channel  MOSFETs  by  using  a  augmented  mobility 
model  [5]. 


FIGURE  4  Experimental  and  simulated  output  curves  at  T— 
300  K  for  a  LE ff  —  0.6  pm  MOSFET.  The  simulated  curves 
have  been  obtained  by  using  the  mobility  shown  in  Figure  3, 
whose  dependency  on  the  longitudinal-electric  field  was 
obtained  with  this  new  method. 
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5.  CONCLUSIONS 

A  new  experimental  method  for  determining  the 
dependence  of  the  electron  mobility  on  the  longi¬ 
tudinal-electric  field  has  been  developed,  validated 
and  applied  to  standard  submicron  MOSFETs. 
This  method  can  be  easily  implemented  in  a  2D 
drift-diffusion  simulator.  All  the  different  depen¬ 
dencies  of  the  electron  mobility  on  the  longi¬ 
tudinal-electric  field  can  be  taken  into  account  at 
the  same  time  by  means  of  this  method.  A 
saturation-velocity  value  of  1.1  xlO7  cm/s  was 
calculated  by  comparing  the  mobility  curves  to 
Thornber’s  expression. 
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Quantum  Distribution-function  Transport  Equations 
in  Non-normal  Systems  and  in  Ultra-fast  Dynamics 
of  Optically-excited  Semiconductors 
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The  derivation  of  the  quantum  distribution-function  transport  equations  combines  the 
Liouvillian  super-Green’s  function  technique  and  the  lattice  Weyl-Wigner  formulation 
of  the  quantum  theory  of  solids.  A  generating  super-functional  is  constructed  which 
allows  an  algebraic  and  straightforward  application  of  quantum  field-theoretical 
techniques  in  real  time  to  derive  coupled  quantum- transport,  condensate,  and  pair- 
wavefunction  equations.  In  optically-excited  semiconductors,  quantum  distribution- 
function  transport  equations  are  given  for  phonons,  plasmons,  photons,  and  electron- 
hole  pairs  and  excitons  by  transforming  the  Bethe-Salpeter  equation  into  a  multi-time 
evolution  equation.  The  virtue  of  quantum  distribution  function  is  that  it  allows  easy 
application  of  ‘device-inflow’  subsidiary  boundary  conditions  for  simulating  femtose¬ 
cond  device-switching  phenomena. 

Keywords:  Quantum  distribution  function,  quantum  transport,  ultra-fast  optics,  excitons,  non¬ 
equilibrium  superconductivity,  nonequilibrium  superfluidity,  lattice  Weyl  transform,  Liouvillian, 
super  Green’s  function 


1.  INTRODUCTION 

There  is  a  need  for  generalized  quantum  distribu¬ 
tion-function  transport  equations,  valid  for  non¬ 
normal,  non-uniform,  and  ultra-fast  systems,  as 
bases  for  large-scale  computer  simulations.  This 
becomes  urgent  with  advances  in  material  science, 
ultra-fast  laser  probes,  nanofabrication,  and  the 
development  of  more  powerful  energy  beams.  The 
drive  to  produce  systems  which  are  functionally 
more  dense  and  have  wider  bandwidths  will  lead 
nanostructure  devices  to  atomic-scale  dimensions 


with  different  materials:  insulators,  semiconduc¬ 
tors,  metals,  and  superconductors. 

The  nonequilibrium  quantum  transport  theory 
including  pairing  dynamics  is  formulated  in  terms 
of  the  Liouville-space  (L-space)  quantum-field 
theory  [1-2]  and  lattice  Weyl  transform  technique 
[3,  4].  For  normal  systems,  this  reduces  to  the 
nonequilibrium  Green’s  function  technique  of 
Schwinger  [5],  Kadanoff  and  Baym  [6],  and  Keldysh 
[7],  coupled  with  the  lattice  Weyl-Wigner  formula¬ 
tion  of  the  quantum  theory  of  solids  [3,  4].  Several 
new  results  are  derived  with  the  present  approach. 
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This  L-space  approach  has  provided  the  action 
principle  for  a  multi-variable  functional  theory  of 
nonequilibrium  condensed-matter  systems  [8-10]. 
Thus,  the  method  set  forth  here  may  open 
doors  for  the  investigation  of  ultra-fast  dynamics 
in  quantum  nanostructures.  So  far,  only  the 
distribution-function  approach  has  characterized, 
in  time-domain,  a  highly-nonlinear  and  highly- 
nonequilibrium  quantum  behavior  [11-14]. 


2.  QUANTUM  DYNAMICS  IN  LIOUVILLE 
SPACE 

The  density-matrix  equation  of  quantum  statistical 
dynamics  in  Hilbert  space  (H-space)  becomes  a 
super-Schrodinger  equation  for  the  super-state 
vector  in  L-space  as 

m^\p(t)))  =  j?\Pm.  (1) 

p(t)  is  the  density-matrix  operator  for  the  whole 
many-body  system  in  H-space,  and  |p(/)))  is  the 
corresponding  super-state  vector  in  L-space.  The 
super-operator  $£  corresponds  to  the  commutator 
[^f,p],  and  is  referred  to  in  this  paper  as  the 
Liouvillian.  Thus,  we  may  write  the  Liouvillian  if 
as  if  =  k  -  k ,  which  define  k  and  k.  These 
have  the  property  that  k\p{t)))  —  \  jf p(t)))9  and 
k\p{t)))  —  | p{t)^)).  These  relations  are  valid  for 
fermions  and  bosons.  For  number-conserving 
fermion  operator  ,  if  =  k\  The  quantum  field 
super-operators,  $$)  and$($)9  are  defined 
through  their  commutation  relations  in  H-Space. 


3.  SUPER  S-MATRIX  THEORY  IN  L-SPACE 

The  “transition  probability”  in  L-space  is  given  by 
the  following  relation 

«A(0Ip(0»  =  «l|S(oo,  -oo)|peq»  (2) 


where  |1))  is  a  unit  super-vector.  We  have 
/ 

M0»  =  T  exp  f^J  -oo)|peq)) 

0 

0 

|A(0»  =  Tac  expjy  J  if0  (/')<*'}$(*,  oo)|0» 

t 

(3) 

where  Tac  denotes  anti-chronological  time  order¬ 
ing  and 

S(t,  -oo)  =T  exp  j  ^  J  if i o(t')dt ' 

TcA^!-j{,')d‘1} 

Texpi^-^-J  &0 (t')dt' 

The  “transition  probability”  obeys  the  equality 

«l|S(oo,  -oo)|peq»  =  exp  l-W  (4) 

where  W  is  identified  as  the  effective  action.  It  can 
be  shown  that 

w= J\mKjt-&\p{t)))dt  (5) 

This  relation  forms  the  basis  of  a  time-dependent 
functional  theory  of  condensed  matter  discussed  by 
Rajagopal  and  Buot  in  a  series  of  papers  [8-10]. 


4.  GENERALIZED  QUANTUM 
DISTRIBUTION  FUNCTIONS 

We  introduce  a  4-component  second  quantized 
quantum  field  super-operators  given  by 

*(l)r  =  (</i(l)  #(1)  ^t(i)  $i))  (6) 

This  is  a  generalization  of  the  multi-component 
quantum  field  operator  first  introduced  by  Nambu 
[15],  and  by  De  Dominicis  and  Martin  [16].  In 
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terms  of  the  ^  fields,  a  system  Liouvillian  if  can  in 
general  be  expressed  as 


M 

^  =  £v(1,2A-..,W1W)*(3) 

N=  1  ‘W 

...^(AQ  +  ifext 

where  the  Schwinger  source  term  is  given  by 
^ex«  =  i%xt  -  iVt  =  «(1)®(1)+  «(12)®(1)*(2). 
The  field  super-operator  averages  can  be  written  in 
terms  of  the  5-matrix,  e.g., 


*0,2)  = 


62{{\\S{oo,-oo)\pe(i)) 
\m)  6u(2)6u(\) 

((1|5(00,  -00)|peq)) 


(8) 


We  obtained  the  following  ( e-1  for  bosons, 
e=—l  for  fermions), 


9F(1,2  )  =  ih 


n 1,2) 

eGT(  1,2) 


^1.2)) 


(9) 


where  the  superscript  T  indicates  the  taking  of  the 
transpose,  and, 


expanded  about  the  condensate  for  Wick’s  theo¬ 
rem  to  be  applicable)  to  evaluate  the  ^’s  in  terms 
of  diagrams  or  graphs,  «2f(l,2,. .  .n)  represents  the 
topologically  distinct  ‘connected’  subset  of  graphs. 
We  have 


n)  =  {ih)n 


<$”  ln((l|5(oo,  -oo)|peg)) 
6u(n)  6u(n  —  1) . 6u(l) 

(ii) 


Similar  functional  derivative  relations  can  be 
obtained  between  GQDF  with  even  number  of 
indices  by  using  the  variation  with  respect  to  the 
external  Schwinger  source  term  w(l,2).  Since 
w(  1,2)  is  an  ordinary  onumber,  the  order  of  the 
u(i,  j)' s  is  not  critical  in  taking  the  functional 
derivatives. 


4.1.  Self-Consistent  Equations  for  Generalized 
Quantum  Distribution  Functions 

We  have 


G(l,  2)  corresponds  to  the  Keldysh  nonequilibrium 
Green’s  function.  We  refer  to  the  ^(1,2)  simply  as 
moment  quantum  distribution  function.  Moments 
are  defined  for  time-ordered  quantum  field  super¬ 
operators.  We  also  define  quantum  correlation 
functions  or  quantum  cumulants,  JT,  analogous  to 
the  classical  statistical  theory.  This  distinction  is 
important  in  treating  the  quantum  transport  of 
superfluids  and  quanta  of  real  classical  fields.  We 
will  also  refer  to  both  as  generalized  quantum 
distribution  functions  (GQDF). 

In  the  application  of  Wick’s  theorem  (for 
superfluid  Bose  system  it  is  assumed  that  is 


^°(12)_1  -  S(12) 


=  *(13) 


(12) 


where  ^0(£",  2)_1  =  Hi ,  2)d/dt2~ 
,  2),  where  the  (t|) -matrix  arise  from  the 
commutation  relation  of  the  #(/,/)’ s,  if  (f”,  2)  is  a 
one-body  potential  matrix,  and  £(12)  is  the 
particle  super  self-energy  matrix.  For  fermions, 
^(1,2)  =  Jf(l,2).  £(12)  is  expressed  in  terms  of 
jT’s  (up  to  second-order  cumulants)  and  vertex 
functions.  These  vertex  functions  obey  equations 
similar  to  the  Dyson  equation,  involving  functional 
derivative  of  the  self-energy  with  respect  to  second- 
order  GQDF  hence  decoupling  the  BBKGY 
hierarchy.  The  self-energy  due  to  e-e  interaction 
includes  the  electron-plasmon  vertex  function. 


5.  QUANTUM  TRANSPORT  EQUATIONS 

The  time-evolution  equation  for  (f,  j)  is 
obtained.  We  write  the  resulting  equations  for 
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the  2x2  matrix  elements  of  c/f  (i,j)  as 


G°~'  G  =  8  +  EG  +  A  *3 

(13) 

G°~'T  GT  =  6  +  HtGT  +*  A3 

(14) 

G°“'rt3  =  S7’t3  +  etAG 

(15) 

G°  ' ^  —  eAGT  +  S3 

(16) 

where  A  is  the  pair  potential  or  gap  function,  and 
is  a  diagonal  matrix  with  elements  propor¬ 
tional  to  ih8{\2)dldt2-%\2)  with  v(12)  propor¬ 
tional  to  a  one-body  external  potential.  These 
equations  were  also  given  by  Aronov,  et  al  [17a] 
and  formally  resemble  the  well-known  Gorkov 
equations  [17b]  for  superconductors  at  thermal 
equilibrium. 

Solving  quantum  transport  problems  [11]  cen¬ 
ters  on  the  evolution  of  p<(12)  =  eih  G<(12), 
which  happens  to  be  one  of  the  matrix  elements 
of  the  nonequilibrium  matrix  Green’s  function  of 
Eq.  (9).  G>}<  contains  all  information  about  the 
statistical  aspects  of  the  field  intensity.  This  is 
coupled  to  the  ‘advanced’  and  ‘retarded’  propaga¬ 
tors  often  directly  related  to  the  experiment  and 
contain  all  the  energetics  and  dynamical  informa¬ 
tion  of  the  system.  We  obtain  the  transport 
equation  for  G>5<  given  by  the  following  expres¬ 
sions. 


5.1.  Nonequilibrium  Superconductivity 

We  have  the  following  expressions  for  super¬ 
conductors, 

K  J;+^)G>,<<12)  =  [v"+ReE''  G>,<1 

-  [Re  <7,  £>■<] 

-|{r,  g>'<}+^{a,  s>’<} 

+  { A/I Ste<  ^Sht^ee) 

+  {^hCfee  +  ShlAe/}  (17) 


K£+£)G'(i2)=[(i,+s'>'G'1 

T  [A ee  —  Ae  J  +  {  A^  gree  +  g Aee }  ^  j  g ^ 

“  i^Mifee+^hh^fe}  +  \ghh  ~  Shhl^ee  ~  \ghh 
-g<J[A>-A<] 

Equations  for  the  pair  wavefunctions,  g>,<:, 
g^K ,  grhh,  etc,  are  also  obtained.  Using  lattice 
Weyl  transformation,  we  can  transform  the  above 
total  time  evolution  equations,  Eqs.  (17) -(18) 
into  quantum  transport  equations  in  (p,  q,  E ,  t) 
phase  space  for  superconductive  systems. 

5.2.  Nonequilibrium  Bose  Superfluids 

The  expressions  for  ih(d/dt\  +  d/dt2)  G>5<(12), 
iti(d/dt\  +  d/dt2)Gr(\2),  and  pair-wavefunction 
equations,  are  similarly  obtained  by  taking  e=l 
in  Eqs.  (13)  -  (16)  and  applying  the  self-consis¬ 
tency  condition  for  the  boson-particle  self-ener¬ 
gies.  We  can  also  transform  the  total  time 
evolution  equations  into  quantum  transport  equa¬ 
tions  in  (p,  q,  E ,  t)  phase  space  for  Bose  superfluid 
systems  [18].  This  will  not  be  given  in  this  paper. 

6.  ULTRA-FAST  DYNAMICS  OF  EXCITED 
SEMICONDUCTORS 

The  physics  of  highly-excited  semiconductor  het¬ 
erostructures  has  been  of  continuing  research 
interest  [19-23].  The  electron-hole  (e-h)  pair 
theory  is  formally  identical  to  the  theory  of 
superconductivity  for  extremely  high-density  of 
e-h  pairs.  However,  in  the  low  density  limit  real 
electron -hole  pair  bound  states  do  occur,  these 
are  localized  composite  e-h  elementary  excita¬ 
tions  called  ‘excitons’  [23].  The  device  physics  [24] 
of  interacting  matter  and  radiation  requires  the 
consideration  of  all  ranges  of  e— h  densities  and 
short-time  dynamics. 

The  multi-component  (in  the  “hat”  and  “tilde” 
indices)  quantum  field  super-operators  for  electro- 
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magnetic  field  (transverse  and  longitudinal)  and 
lattice  vibrations  are  given,  respectively,  by 


We  used  a  “composite  field  operator”  as  the 
fourth  field,  ^(1,2),  since  coupling  to  fermions 
only  occur  through  bilinear  product  of  fermion 
fields.  This  is  made  up  of  #"(12),  5R(12),  e5ftr(12), 
and  ^(12),  consisting  of  different  combination  of 
the  “hat”  and  “tilde”  indices.  We  have, 

V(l,2)r=  ^(1,2)  8(1,2)  e$tT(  1,2)  ^(1,2 ;)) 

(20) 

The  time-ordered  averages  of  the  components  give 
the  familiar  quantities,  namely, 

1,2))  =  JT(1,2)=(J.^)  $,;*>) 

(21) 

The  scalar  potential  field  super-operator  0(Q  is 
not  an  independent  field  [25]. 

7.  ELECTRON  -  HOLE  AND  EXCITON  TIME 
EVOLUTION  EQUATION 

The  super  propagator  for  electron-hole  and 
exciton  is  defined  by 

•r(55;12)=^o|  <22) 

/ext(  12)  includes  the  matrix  element  of  the  dipole 
moment  containing  the  selfconsistent  transverse 
electromagnetic  field  in  the  microcavity,  for 
example.  In  the  matrix  definition  of  JT(12;  34), 
the  nonequilibrium  pair  propagator  is  identified  as 
one  of  the  matrix  elements,  namely,  the  ‘(l>4th)’- 
component. 


7.1.  Schrodinger-Wannier  Equation 
for  the  Exciton  Wavefunction 

It  is  helpful  to  derive  from  the  Bethe-Salpeter 
equation,  the  Schrodinger-Wannier  equation  for 
the  electron-hole  pair  wavefunction.  Neglecting  the 
self-energy  terms  and  retaining  only  the  Coulomb 
interaction  terms,  we  obtain  after  setting  t\  =  t2  =  t 


This  is  the  familiar  Schrodinger  equation  for  a 
two-particle  system  consisting  of  an  electron  and  a 
hole.  Using  a  dielectric  function  to  screen  the 
potential,  the  resulting  bound  states  correspond  to 
exciton  states  [23]. 


7.2.  Time  Evolution  Equations  for  JT  (12;  34) 

The  total  time  derivative  of  (12;  34)  is  obtained 
from  the  Bethe-Salpeter  equation,  the  details  will 
be  published  elsewhere.  The  ‘(l?4th)’  element 
describing  the  electron -hole  pair  propagator  is 
denoted  by  ^exciton  (12;  34),  which  in  turn  contains 
in  its  ‘(1, 4)th’  element  the  e-h  pair  density  matrix 
^citon(12;  34). 


7.3.  Transport  Equation  for  the  Nonequilibrium 
Pair  Propagator 

What  needs  to  be  done  is  to  single  out  the  equation 
for  the  electron-hole  pair  density  matrix,  which  we 
denote  by  ^XCitOn02;  34).  The  result  is  very  long 
and  will  be  published  elsewhere.  Interested  readers 
are  encouraged  to  contact  the  authors  for  this 
detail.  The  final  transport  equation  is  obtained  by 
setting  tx  =  t2,  and  /3  =  /4  and  taking  the  lattice 
Weyl  transform  of  the  above  transport  equation 
[in  general  double  lattice  Weyl  transform  in  spatial 
variables].  To  demonstrate  this,  let  us  take  the 
simplest  approximation  of  this  equation,  which  we 
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write  us 


The  lattice  Weyl  transform  can  be  cast  in  the  form 


.J  d  d 

ih[  -z—+— 


-)^xciton(12,  <1!  34,  /3) 


Qt i  1  J  ''excitonV^ 

=  f  [v(12")]^e<xciton(2"2,  4;  34,  t3) 

1  -^citon(12",  ri;  34,  4)[vr(2"2)] 
_  f  [v(33")]^<citon(12,  3"4,  /3) 

I  -^xciton(12,  4;  34",  4)[vr(4"4)] 


Upon  taking  the  lattice  Weyl  transform  and  taking 
the  gradient  expansion  the  result  is 


—  iSy(k,  k';  p" ,  4;  w,  tc) 

k  ,  p  ,  4>  CtZ,  4)  t  4,  tc) 

Similarly,  we  can  write  the  phonon  “hole” 
distribution  as 

—  iSy(k,  k'\  p" ,  lc;  u,  tc) 

=  Av(k,  k';  p",  4;  w,  tc)  [1  +  n(p",  4;  ut,  4)] 


ihoffxciion(Pu  qi;  Pii  qi\  E,  t) 
=  €  (pi)»V?1 


1 

iton (pu  q\ ;P2,  qr  e,  t) j  V5/2 


8.1.  Phonon  Transport  Equations 

We  obtain  the  following  expressions 

(  d1  d2\ _ 


+  th\  ^ pi  £  {pi)  •  v9 


^Txciton (Pi.  92;  £,  0| 

dp'^ipu  qp,p\,  qx) 

^exdton  (^1  ’  Ph  ^2  5  ^  0 

~  M  /  dp'2^p2' qr’ p'2’ 

^xdton (/»i.  q^p'n  qr,  e,  t)  (25) 

The  last  equation  is  the  appropriate  Wigner 
distribution  transport  equation  of  a  two-particle 
distribution  corresponding  to  the  Schrodinger- 
Wannier  equation  of  Eq.  (23) 


8.  TRANSPORT  EQUATIONS 
FOR  PHONONS 

The  “reduced  density  matrix”  for  phonons  is 
defined  by 

Sp(tf,;tt')  =  T(rMt,t)uJ«;',t'))  (26) 


=  Reir-tf(2),  5>'<  +  n>,<,Re5'- 


/|lm  nr,  S>'<)  -  z{  n>,<,ImS'- 


=  Ref]  —^(2))  S>,<  +  n>,<>Re5 


sr-a  = -\d{2\  s™  +  nrfl,  s 


&__e^ 

dt2  bt> 


where  we  have  used  the  definition  for  the  force 
constants 


4  m  = 


wm 


and  the  following  identities 

,T  or  s>-s<  ,T  n>-n< 

zImSr  = - - - ,  z  Im  11  = - - -  (32) 

What  then  take  the  lattice  Weyl  transform  of  Eqs. 
(29)  and  (30)  to  obtain  the  transport  equations  in 
(p,  X/c,  u>,  t)  =  (p ,  q,  E,  t)  phase-space. 
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8.2.  The  Phonon  Boltzmann  Equation 

More  revealing  equations  can  be  seen  by  neglect¬ 
ing  off-diagonal  or  “inter  branch”  terms,  and 
expanding  the  equations  in  terms  of  the  gradients. 
We  defined  a  renormalized  kinematic  frequency 
by  ft  XX  (p>  9,  E,  t)  =  ^AA2  -  9,  E,  t).  The 

Boltzmann  equation  for  the  distribution  function 
of  phonons,  nxx  ip ,  q ,  E,  t),  from  vibration  branch 
A  readily  follows  by  neglecting  leading  quantum 
corrections.  We  obtain  the  familiar  interpretation 
[22]  relating  (i/2){%n^x{p,q,E,t)/E}  as  the  scat¬ 
tering-out  rate  and  (i/2){hU^x(p,  q ,  E ,  t)/E}  as  the 
scattering-in  rate.  By  taking  the  renormalized 
frequency  to  be  given  by  the  solution  of  the 
equality  Clx x  (/?,  q,  w  t  )  =  u;,  with  this  solution 
denoted  by  u)X(p ,  t)  ,  we  finally  obtain 


n\(p,  q, oj\,  t )  +  Vphujxip,  0  •  V?«a(/>,  q, u\,  t ) 


+ 


*TIa(p»  g.^A.O 

2wa0) 


1  +«aO>^wA,0 


(33) 


The  renormalized  group  velocity  [26]  emerges  since 
phonons  do  not  diffuse  freely  but  interact  with  the 
environment  as  well  as  collide  with  each  other. 
Similar  situation  arises  in  deriving  the  plasmon  and 
photon  Boltzmann  equations.  We  expect  the 
leading  term  of  Re  J|aa  (P>  ^  0  t0  be  independent 

of  time.  An  example  of  the  RHS  of  Eq.  (33)  for 
phonon -phonon  interaction  can  be  found  in  the 
work  of  the  author  [27]  and  in  the  study  of  phonon 
hydrodynamics  and  second  sound  [28]. 


9.  TRANSPORT  EQUATIONS 
FOR  PLASMONS 


mann  equation  is  obtained  by  means  of  gradient 
expansion.  We  have 


{p,q,E,  t)  + 


'f-p- V^Re  ro»g.-M~ 

.  ^Ren  r0(p,q,E,t)  . 


*  ^ qD^  ip,q,  E,  t) 


_  »T  Uo'<(p^,E)t) 

2U|,Ren>,<?,£ 

■  if.  n^c Mi3A 

2\^WE^Uo(P>9,E,t) 


D^ip^E, 


where  we  have  left  out  terms  involving  diffusion 
in  momentum  and  energy  space.  We  would  like 
to  point  out  the  notable  use  of  the  renormali¬ 
zed  plasmon  group  velocity  given  by 
^p-VpRcU[(p,q,Ej))/fERQU:iP,9,Ej).  The 
use  of  the  renormalized  diffusion  velocity  arises 
from  the  same  reason  as  that  given  in  the 
derivation  of  the  Boltzmann  equation  for  pho¬ 
nons.  The  right-hand  side  are  the  familiar  collision 
terms.  The  omitted  terms  describe  the  kinematics 
of  the  dynamical  motion  of  plasmons. 


10.  TRANSPORT  EQUATIONS 
FOR  PHOTONS 


We  will  also  neglect  the  off-diagonal  terms  in 
polarization  indices.  The  result  to  leading  order  is 
given  by 


®  z7>,</n  p  f\  ,  ftAA  (p,q,E,t) 

■g/xx  ip,q,E,t)  + - - - 

VphCtxxip,  q,  E,  t)  •  VqF^(p,  q,  E,  t) 

+  i{  (35) 


9.1.  The  Plasmon  Boltzmann  Equation 

The  plasmon  super-propagator  is  given  by 
D0((,(f)  —  £(©(C))/£pext(C0*  The  plasmon  Boltz- 


where  ft^A0,  q,  E,  t)  =  c2p2  -  RePrxx(p,  q,  E,  t)  and 
we  have  neglected  the  leading  quantum  corrections 
on  the  right-hand  side.  We  note  that  Pr  is  related 
to  the  transverse  dielectric  tensor  [29].  We  may 
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consider  ftAA(/?,  q ,  E,t)  >  0  to  constitute  the  valid¬ 
ity  of  the  Boltzmann  equation  for  photons  with 
diffusive  term.  The  use  of  renormalized  group 
velocity  corresponds  to  the  use  of  refractive  index 
well-known  in  optics. 

A  self-consistent  Boltzmann  transport  equ¬ 
ation  follows  by  setting,  (p)q)u>,t)  =  c2p2— 
RePr(p,q,u>,  t)  =  u)2  >  0.  Upon  substituting  the 
solution  for  uj,  which  we  will  denote  by  wa  (p ,  t ), 
we  obtain 


d_ 

dt 


F\C  (a  q,  ,  t)  +  vphu)X (p,  t)  •  VqFxC ( p,q,w\,t ) 


11.  CONCLUDING  REMARKS 

A  major  challenge  in  analyzing  the .  ultra-fast 
dynamics  of  semiconductor  gain  material  for 
investigating  microcavity  lasers  is  the  fact  that 
the  electron -hole  system  properties  are  strongly 
affected  by  many-particle  Coulomb  interactions, 
and  strong  coupling  to  the  light  and  crystal  lattice- 
phonon  fields  [24].  In  device  physics  of  highly 
excited  semiconductor  systems,  involving  all 
ranges  of  e-h  pair  densities,  the  exciton,  e-h 
Cooper  pairs,  and  e-h  plasma  energetics  and  their 
accompanying  distributions  greatly  affect  the 
polaritons,  phonons,  biexcitons,  and  higher-order 
‘pairing’  (excitonic  molecules)  distributions,  their 
wavefunctions  and  energies.  For  the  bosons  of  real 
fields,  the  diffusion  velocity  is  not  equal  to  the 
group  velocity  of  bare  excitations  but  is  defined 
only  by  its  renormalized  value.  Quanta  of  these 
classical  fields  interact  with  the  environment  or 
collide  with  each  cither  as  it  diffuse  in  space.  The 
condensate  and  normal  excitation  energetics  and 
their  accompanying  distribution  also  influence  the 
gap  function  and  energy  gap.  The  full  transport 
equations  will  be  published  elsewhere. 
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In  a  companion  presentation,  we  have  discussed  the  theory  of  a  mesoscopic/ 
macroscopic  model,  which  can  be  viewed  as  an  augmented  drift-diffusion  model.  Here, 
we  describe  how  that  model  is  used.  The  device  we  consider  for  this  presentation  is  the 
one  dimensional  GaAs  n  +  —n—n  +  structure  of  length  0.8  pm.  First,  a  full  Hydro- 
Dynamic  (HD)  model,  proven  reliable  when  compared  with  Monte  Carlo  simulations,  is 
used  to  simulate  the  device  via  the  ENO  finite  difference  method.  As  applied  to  the  full 
device,  the  new  model  is  not  necessarily  superior  to  traditional  Drift-Diffusion  (DD). 
Indeed,  when  we  plot  the  quantity  rj  =  PqE/  yJkTo/m,  where  p0  is  the  mobility  constant 
and  E=  -  <j>'  is  the  electric  field,  we  verify  that  the  high  field  assumption  rj  >  1,  required 
for  the  high  field  model,  is  satisfied  only  in  an  interval  given  approximately  by  [0.2,  0.5]. 
When  we  run  both  the  DD  model  and  the  new  high  field  model  in  this  restricted  interval, 
with  boundary  conditions  of  concentration  n  and  potential  </>  provided  by  the  HD 
results,  we  demonstrate  that  the  new  model  outperforms  the  DD  model.  This  indicates 
that  the  high  field  and  DD  models  should  be  used  only  in  parts  of  the  device,  connected 
by  a  transition  kinetic  regime.  This  will  be  a  domain  decomposition  issue  involving 
interface  conditions  and  adequate  numerical  methods. 
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1  INTRODUCTION 

In  previous  work,  we  have  demonstrated  the 
robustness  of  an  algorithm  (ENO:  Essentially 
Non-Oscillatory)  designed  for  the  simulation  of 
the  hydrodynamic  model  for  semiconductors  over 


a  wide  range  of  parameters.  In  [5]  and  [6], 
n+—n~n  +  diodes  in  one  dimension  and  MES- 
FETS  in  two  dimensions  were  simulated.  The 
present  paper  deals  with  the  high  field  model 
introduced  in  [3],  and  appearing  elsewhere  in  these 
proceedings  [2]. 


*  Corresponding  author:  Tel.:  (847)  491-5575,  Fax:  (847)  491-8906,  e-mail:  jwj@math.nwu.edu. 


275 


276 


C.  CERCIGNANI  et  al 


2  THE  MODELS 

The  new  model  is  an  extension  of  the  Drift- 
Diffusion  (DD)  model.  Hence  we  first  describe  the 
DD  model  in  a  context  for  comparison. 

2.1  Drift-Diffusion  Model  in  One  Dimension 
with  \jl  Depending  On  E 

The  DD  model  is  well  documented  (see,  for 
example,  [4]).  It  is  given  by: 

nt  4~  Jx  —  0,  (2.1.1) 

where  the  representation  in  terms  of  hyperbolic 
and  viscous  components  is  given  by 

J  *^hyp  4“  Jyhi  (2.1.2) 

and 

<4yP  =  ~[mE, 

Jvis  =  —r{n0)x.  {  J 

Here,  n  denotes  carrier  concentration  and  J 
denotes  current  defined  per  unit  charge  modulus. 
We  have  separated  current  components  in  antici¬ 
pation  of  the  high  field  model  to  follow.  Also,  the 
electric  field  is  denoted  by  E ,  so  that  in  terms  of  the 
electrostatic  potential  <£, 

E=~<j>x ,  (e</>x)x  =  e(n-hd),  (2.1.4) 

where  we  use  the  customary  expressions,  for 
doping  nd,  dielectric  e,  and  charge  modulus  e .  Set 


Here,  vd  has  the  interpretation  of  the  saturation 
velocity  and  //0  is  the  low  field  mobility.  The 
velocity  for  the  DD  model  is  a  derived  quantity 
computed  by  v  =  J/n. 

The  device  we  consider  for  this  presentation  is 
the  one  dimensional  GaAs  n+ -n-n  +  structure 
of  length  0.8  pm.  The  device  used  is  as  follows: 
x  e  [0,  0.8];  the  doping  is  defined  by  nd{x)  =  105/ 
pm3  in  0  <  x  <  0.1  and  in  0.5  <  x  <  0.8,  and  by 
nd (x)  =  2  x  103  in  0.15  <  x  <  0.45,  with  a  smooth 
intermediate  transition.  The  boundary  conditions 
are:  fixed  n  at  both  ends,  and  fixed  (j)  at  both  ends 
(with  a  difference  —  v  bias).  Simulations  are  per¬ 
formed  for  vbias  =  0,  0.5  and  1.0  V,  but  results  are 
shown  for  v  bias  =1.0  only  to  save  space.  Other 
parameters:  m  =  0.065x0.9109  (10“3OKg), 
<?  =  0.1602  (10“18C), Jc^  =  0.138046xl0“4  (10"18 
J/Kelvin),  e  =  13.2x8.85418  (10“18F/pm). 

In  the  definition  of  /a,  we  take  fi0=  4.0  |am2/V/ 
ps.  We  consider  T0  —  300  K  for  which  vd=  0.6 
jam/ps,  which  is  taken  to  be  the  maximum  of  the 
velocity  in  the  HD  run  with  vbias=  1.0. 

The  results  are  shown  in  Figure  1 .  They  do  not 
match  as  well  with  those  of  the  HD  model  (with  a 
doping  nd  dependent  fi),  as  in  the  silicon  case 
which  is  not  presented  here.  The  oscillations  in  the 
velocity  near  the  left  junction  are  due  to  the 
numerical  differentiation  of  rapidly  changing 
quantities. 

2.2  The  High  Field  Model 

The  model  can  be  written  as  follows: 


e  =—T0. 

m 


(2.1.5) 


r  denotes  the  relaxation  time,  m  denotes  effective 
mass  and  ambient  temperature  is  denoted  by  T0, 
with  0  given  in  energy  units.  Of  course,  /x  is 
mobility,  and  we  take  to  be  dependent  upon  the 
electric  field  E ,  using  the  formula  (2.42)  in  [4]: 


nt+Jx  =  Q,  (2.2.1) 

where  the  representation  in  terms  of  hyperbolic 
and  viscous  components  is  now  given  by 

J  —  </hyp  4“  J\i s>  (2.2.2) 

and 
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FIGURE  1  Comparison  of  the  results  of  the  HD  model  with  a  doping  nd  dependent  fi  (solid  line)  and  that  of  the  DD  model  with  an 
electric  field  E  dependent  p  (dashed  line).  GaAs,  T0=  300,  vbias  =  1.0.  Top  left:  concentration  n  in  pm"3;  top  right:  the  electric  field 
E  -  -<j)x  in  volts/pm;  bottom  left:  the  potential  (j>  in  volts;  bottom  right:  the  velocity  v  in  pm/ps. 


Also,  the  electrostatic  equation  is  satisfied  as  in 
the  DD  model,  and  r  and  6  are  defined  similarly. 
Furthermore,  w  is  taken  to  be  a  constant: 

u  =  (finE)\x=0,  (2.2.4) 

and  the  velocity  is  again  derived  quantity. 

We  again  consider  the  same  GaAs  device  as 
before.  The  results  are  shown  in  Figure  2.  Not 
much  improvement,  if  any,  is  observed  over  the 
DD  results.  The  spikes  in  the  velocity  near  the 
junctions  are  due  to  the  derived  nature  of  that 
quantity;  particularly  the  numerical  differentiation 
of  rapidly  changing  quantities. 

3  RESTRICTION  TO  CHANNEL 

We  plot  the  quantity  rj  =  hqE/  y/(kTo/m)  where 
/xo  is  the  mobility  constant  and  E  ~  —<$>'  is  the 


electric  field  (obtained  from  the  HD  simulation), 
in  Figure  3. 

We  can  clearly  see  from  Figure  3  that  the  high 
field  assumption  r?  >  1  is  satisfied  only  in  an 
interval  given  approximately  by  [0.2,  0.5].  We  thus 
run  both  the  DD  model  and  the  new  high  field 
model  in  this  restricted  interval,  with  boundary 
conditions  of  concentration  n  and  potential  (f> 
provided  by  the  HD  results.  We  can  see  in  Figure 
4  that  the  new  model  decisively  outperforms  the 
DD  model. 

This  indicates  that  the  high  field  and  DD 
models  should  be  used  only  in  parts  of  the 
device,  connected  by  a  transition  kinetic  regime. 
This  will  be  a  domain  decomposition  issue 
involving  interface  conditions  and  adequate 
numerical  methods,  and  is  currently  under 
investigation. 
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FIGURE  2  Comparison  of  the  results  of  the  HD  model  (solid  line)  and  that  of  the  new  model  (dashed  line).  GaAs,  T0  =  300,  vbias 
=  1.0.  Top  left:  concentration  n  in  pm-3;  top  right:  the  electric  field  E—  —<j)x  in  volts/pm;  bottom  left:  the  potential  0  in  volts;  bottom 
right:  the  velocity  v  in  pm/ps. 


FIGURE  3  The  quantity  rj  —  fioE/ ^(kTo/m),  where  E  =  —  0' 
is  the  electric  field  obtained  from  the  HD  simulation. 


4  COMPARISON  WITH  KINETIC  AND 
AUGMENTED  DRIFT-DIFFUSION 
MODELS 

The  high  field  model  may  be  thought  of  as  a  form 
of  an  augmented  DD  model.  In  this  section,  we 
briefly  compare  the  results  of  our  simulations  with 
those  of  [1]  and  [7].  The  results  of  [7]  deal 
primarily  with  tracking  the  carrier  drift  velocity  in 
Silicon.  The  model  is  very  similar  to  that  originally 
employed  by  Thornber  [9].  We  note  that  the 
perturbation  terms  of  the  high  field  model 
introduced  here  include  more  than  simply  differ¬ 
entiated  electric  field  terms.  Comparisons  are 
made  in  [7]  among  various  models,  including 
DD  and  augmented  DD,  as  well  as  the  hydro- 
dynamic  model  and  Monte-Carlo  simulation.  In 
these  studies,  the  authors  of  [7]  tended  to  find  that 
standard  DD  understated  drift  velocity,  even  in 
GaAs.  In  our  own  comparisons,  when  using 
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FIGURE  4  The  comparison  of  the  Drift-Diffusion  (DD)  model  (dashed  line),  the  new  high  field  model  (dotted  line)  and  the 
HydroDynamic  (HD)  model  (solid  line),  in  the  sub-interval  [0.2, 0.5]  using  the  HD  results  as  boundary  conditions.  GaAs  7o-  300 

vbias  =  1.0.  Top  left:  concentration  n  in  pm  3;  top  right:  the  electric  field  E - <j>x  in  volts/pm,  bottom  left,  the  potential  <j>  in  vo  , 

bottom  right:  the  velocity  v  in  pm/ps. 


derived  velocity,  and  a  field  dependent  mobility  in 
DD,  we  did  not  experience  such  a  pronounced 
understatement.  We  have  presented  the  results  of 
the  simulations  in  Figure  1  for  comparison  with 
standard  DD,  and  in  Figure  2  for  comparison 
with  the  high  field  model.  And,  remarkably,  we 
find  that  in  the  channel,  as  a  problem  posed  only 
there,  DD  overstates  the  derived  velocity!  More¬ 
over,  the  high  field  model  predicts  a  velocity 
intermediate  between  the  DD  model  and  the 
hydrodynamic  model. 

Paper  [1]  is  difficult  to  correlate  directly  because 
emphasis  is  placed  upon  frequency  distributions. 
Nonetheless  the  average  velocity  and  electric  field 


calculations  made  there  compare  favorably  with 
our  own.  Issues  of  heating  and  cooling  discussed 
there  are  not  raised  in  the  present  paper,  but  we 
intend  to  consider  the  effect  of  energetics  in  future 
work.  The  model  has  already  been  developed  by 
the  second  author. 


5  ALGORITHM 

We  shall  only  briefly  describe  the  algorithm  used 
in  this  paper,  namely  the  ENO  scheme  developed 
in  [8].  The  ENO  scheme  is  designed  for  a  system  of 
hyperbolic  conservation  laws  of  the  form, 
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ut  =/(«)*  =g(u,x,t),  (5.1) 

where  u  =  («i,. . um)T,  and  the  hyperbolicity 
condition, 


df  . 

—  is  diagonalizable,  with  real  eigenvalues, 


holds.  An  initial  condition  is  adjoined  to  (5.1). 

For  systems  of  conservation  laws,  local  field  by 
field  decomposition  is  used,  to  resolve  waves  in 
different  characteristic  directions.  For  this  pur¬ 
pose,  analytical  expressions  are  needed  for  the 
eigen-values  and  eigenvectors  of  the  Jacobian 
matrix.  This  reduces  the  determination  of  the 
scheme  to  the  case  of  a  single  conservation  law. 
Thus,  to  describe  the  schemes,  consider  the  scalar 
one  dimensional  problem,  and  a  conservative 
approximation  of  the  spatial  operator  given  by 

=  (5-2) 

Here,  the  numerical  flux /is  assumed  consistent: 
fj+i  =Kuj-\ » •  •  • , «/+*);  ■•■,«)  =fW).  (5.3) 


The  conservative  scheme  (5.2),  which  charac¬ 
terizes  the/ divided  difference  as  an  approximation 
to  /(«)„  suggests  that  /  can  be  identified  with  an 
appropriate  function  h  satisfying 


/(«(*))  =  [  (5.4) 

Jx-tf 

If  H  is  any  primitive  of  h,  then  h  can  be 
computed  from  H'.-H  itself  can  be  approximated 
by  polynomial  interpolations  using  Newton’s 
divided  difference  method,  beginning  with  differ¬ 
ences  of  order  one,  since  the  constant  term  is 
arbitrary.  The  necessary  divided  differences  of  H, 
of  a  given  order,  are  expressed  as  constant 
multiples  of  those  of  /  of  order  one  lower.  The 
main  ingredient  of  the  ENO  method  is  the 
adaptive  choice  of  stencil:  it  begins  with  a  starting 
point  to  the  left  or  right  of  the  current  “cell”  by 
means  of  upwinding,  as  determined  by  the  sign  of 


the  derivative  of  a  selected  flux  (or  the  eigenvalue 
of  the  Jacobian  in  the  system  case);  as  the  order  of 
the  divided  differences  is  increased,  the  divided 
differences  themselves  determine  the  stencil:  the 
“smaller”  divided  difference  is  chosen  from  two 
possible  choices  at  each  stage,  ensuring  a  smooth¬ 
est  fit. 
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Simulation  of  Bistable  Laser  Diodes 
with  Inhomogeneous  Excitation 
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A  comprehensive  time-dependent  1-D  computer  model  is  developed  for  the  simulation 
of  multi-section  bistable  laser  diodes.  Analysis  of  a  Fabry-Perot  (FP)  cavity  laser  diode 
indicates  that  the  height  and  shape  of  the  optical  input  pulse  as  well  as  the  wavelength 
play  important  roles  in  the  set/reset  operation  of  optical  switching. 

Keywords:  Transient  simulation,  bistable  laser  diodes,  optical  switching,  set/reset  operation, 
undershoot  switching 


1.  INTRODUCTION 

In  recent  years,  BiS table  Laser  Diodes  (BSLDs) 
have  attracted  much  attention  because  of  their 
potential  applications  in  optical  switching  and 
wavelength  conversion.  The  simulation  of  BSLDs 
involves  large-signal  analysis,  which  currently 
existing  LD  simulators  (such  as  CLADISS  [1]) 
cannot  perform.  Most  studies  of  BSLDs  are 
conducted  using  a  set  of  rate  equations  that  are 
expressed  as  a  function  of  time  only  [2],  Such  an 
approach  has  two  main  limitations.  First,  an 
approximation  of  distributed  loss  is  required. 
Second,  the  spatial  dependence  of  the  gain 
saturation  inside  the  cavity  is  not  taken  into 
account.  Most  existing  models  use  a  single 


variable  to  represent  the  total  photon  density 
without  considering  the  difference  in  wavelength. 
Moreover,  these  models  often  do  not  include  all 
pertinent  physical  processes  which  compete  with 
each  other  to  influence  the  dynamic  character¬ 
istics  of  the  device.  Therefore  a  more  sophisti¬ 
cated  simulator  is  needed  to  assist  the  design  as 
well  as  the  understanding  of  the  physics  of 
BSLDs. 

In  this  work,  a  comprehensive  time-dependent 
one-dimensional  computer  model  is  developed  for 
the  simulation  of  multi-section  bistable  laser 
diodes.  Longitudinal  spatial  hole-burning,  non¬ 
linear  gain,  carrier  induced  refractive  index  change 
(frequency  chirp),  and  thermal  effects  are  rigor¬ 
ously  accounted  for. 
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2.  PHYSICAL  MODEL 


A  typical  longitudinal  view  of  a  multi-section  laser 
diode  is  shown  in  Figure  1.  The  laser  consists  of 
two  gain  sections  and  a  saturable  absorption 
section  in  between. 

The  optical  field  E(z ,  t)  can  be  separated  into  the 
forward  (  + )  and  backward  (-)  propagating  parts, 
satisfying  the  traveling-wave  rate  equations, 


,  dE ±  1  dE ± 


dz 


dt 


(2  7T  .1 

=  (a^  +  I2< 


G)Ed 


(1) 


where 


G( A,  w(z))  =  I>(A,  n(z))  -  Taac  -  (1  -  I'Ki, 

«r(A,«(z))  =  nefr  +  rAnr(A,«(z)), 

and  T  is  the  confinement  factor,  «eff  is  the  effective 
refractive  index  determined  by  waveguide  struc¬ 
ture,  g  is  the  gain  function,  aac  and  ac\  represent 
the  absorption  coefficients  in  the  active  layer  and 
cladding  layers,  respectively,  and  A nr  is  the  carrier 
induced  change  of  refractive  index. 

If  we  assume  E±(z,  t)  =  E±(z,  t)ei<f>±^9t\  after 
substituting  into  equation  (1)  and  separating  the 
real  and  imaginary  parts,  we  obtain 


For  each  longitudinal  mode  (Aw),  the  amplitude 
and  phase  satisfy  the  following  boundary  condi¬ 
tions  at  z  =  0  and  z  =  L: 

E+(0,t)  =  rlE-(0,t),  (4) 

0  =  0m(°>  t)  +  2nm,  (5) 

E-(L,t)  =  r2E+(L,t),  (6) 

4>-(Et)  =  <i>+(L,t),  (7) 

where  m  is  an  integer,  r:  and  r2  are  reflective 
coefficients  of  the  front  and  rear  facets,  respec¬ 
tively.  However,  for  an  incident  optical  field  (at 
z  =  0)  of  an  arbitrary  wavelength  (Ain),  since  the 
incident  optical  field  t\Eex{t)  interferes  with  the 
reflected  field  ri£_(0,  t)  just  inside  the  left  (front) 
facet,  the  boundary  condition  at  z  =  0  becomes 

E+(0,t)  =  rlE7(0,t)  +  tlEn(t),  (8) 

and  at  z  =  L,  it  is  given  by  equations  (6)  and  (7). 
Under  the  steady-state  condition,  equation  (3)  can 
be  rewritten  as 

*-(0,f)-*+(0,f)  =  A*t  (9) 


where 


8E±  1  dE± 

dz  dt 


X-G{  A,»(z))£±, 


(2) 


±d^+vX=2inMz))-  (3) 
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FIGURE  1  Longitudinal  structure  of  a  three-section  laser 
diode. 


■jf 


2 n&L  +  2 T  /  Anr(X,  n(z))dz 


(10) 


is  the  round-trip  phase  change.  For  the  light  in  the 
mth  mode,  A<l>w  must  be  equal  to  Imn.  Thus  we 
can  calculate  wavelength  change  (frequency  chirp) 
by  solving  this  nonlinear  equation.  For  the 
incident  light,  since  its  wavelength  is  determined 
by  the  external  light  source,  A3>in  will  change  with 
the  carrier  density  in  the  active  region.  Since  the 
round-trip  time  (2L/vg~  10 ps)  is  typically  much 
shorter  than  the  response  time  (>0.2^)  for 
devices  under  the  study,  we  can  still  use  the 
boundary  condition  (8)  for  the  time-dependent 
problems,  instead  of  solving  the  rate  equations  for 
the  phases. 

If  we  define  Pm  ( Qm )  as  the  photon  density  in  mth 
mode  traveling  in  the  +z  (— z)  direction, 
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Pm  =  Pm(z,t)  =  l-\E+{z)t)\2lhum,  (11) 

V8 

Qm  =  Qm{z,t)=  —  \Em{z,t)\2/nuJm,  (12) 
V8 

and  take  into  consideration  the  spontaneous 
emission,  the  traveling-wave  rate  equations  for 
the  photon  density  become, 

dPm  ,  dPm ^  ^  D 

~~df  ^  Vg  ~dz  ~~  Vg  '  Gnv1)  ’ 

+  2^sp  *  Psp(n)i  (^3) 

dQm  9Qm  _  n  ( ^  n 
~dt  Vg  ~dT  ~  Vg  *  Q m 

+  \&P-R*p(n),  (14) 

and  the  corresponding  boundary  conditions  are, 
Pm(0,t)  =  RiQm(0,t),  (15) 

Qm{L,t)  =  R1Pm{L,t),  (16) 

where  R\  =  |ri|2and  R2  =  \r2\2- 

For  the  incident  light  (Pin,  gin),  the  rate 
equations  remain  the  same,  while  the  boundary 
conditions  are  different,  given  by 

\Mn(0,  t)  cos(</>q" )  =  y/RioiJ&J)  COS (fa) 

+  v/r,pex(i),(0,  (17) 


ri  of  the  injected  carrier  is  captured  by  the  active 
layer  where  the  carriers  distribute  uniformly  in  the 
lateral  and  transverse  directions.  No  doping  is 
assumed  and  hence  it  follows  from  the  charge 
neutrality  condition  that  the  hole  density  equals 
the  electron  density.  The  rate  equation  for  carrier 
density  n(z,  t )  in  the  active  layer  is  given  by 


dn(z,  t) 
dt 


_  _  JL  _  Rsp(n)  -  CAug«3 

qd  Tnr 

“  vgT  *  |  8m(n)(Pm  +  Qm) 


+  ^in(«)(i,in  +  ein)|.  (21) 


In  our  model,  the  linear  gain  is  approximated  by 

{gA-(n~  n0),  n  <  ni0 

g2  ■  n1  +  gx  ■  n  +  g0)  «20  >  n  >  «io 

gG-{n-n0).  n>n2o 

where  gA  and  gG  are  the  differential  gain  coeffi¬ 
cients  in  the  gain  and  loss  region,  h0  is  the 
transparency  carrier  density.  These  parameters 
can  be  determined  by  theoretical  calculation  or 
experiment.  It  should  be  emphasized  that  different 
values  of  gA,  gG,  and  n0  are  used  for  each 
longitudinal  mode  and  optical  input. 

The  stimulated  emission  becomes  nonlinear 
whenever  large  optical  intensities  exist  in  the  active 
layer.  To  account  for  this  effect  a  simplified  gain 
model  is  used,  i.e., 


y/PjoJ)  sin (fa)  =  v^iCMM  sin(<£0 ),  (18) 

fa  ~  fa  ~  A^in(w)j  (19) 

Qin(L,t)  =  R2Pin(L,t),  (20) 

where  fa(t)  =  faa(0,  t),  fa (t)  =  <^(0,  t),  T\  =  \tx\2 , 
and  Pn(t)  =  (l/vg)  E2fat)/hu>in. 

We  assume  that  all  sections  are  current  con¬ 
trolled  and  that,  due  to  the  leakage,  only  a  fraction 


gNL(\n) 


1  ~£nl^(P  +  Q) 

m,  in 


•£l(A,  «), 


(22) 


where  eNL  is  the  material  gain  compression  factor. 
Then  the  carrier-induced  change  of  refractive 
index  is  assumed  to  be  given  by 

Anr(X,n)  =  -aR (23) 

where  aR  is  assumed  to  be  a  constant. 
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3.  NUMERICAL  SOLUTION  METHODS 

In  this  work,  we  have  improved  the  spatial 
discretization  scheme  described  in  [3],  A  first-order 
backward  finite  difference  formula  is  applied  to 
P(z,  t)  and  a  forward  difference  applied  to  Q(z ,  t). 
The  photon  density  is  defined  at  the  nodes  while 
the  carrier  density  is  defined  between  two  adjacent 
nodes.  The  change  in  the  carrier  density  in  a  region 
now  depends  on  the  average  number  of  photons 
surrounding  that  region.  A  trapezoidal  rule/back¬ 
ward-differentiation-formula  composite  method 

[4]  has  been  implemented  for  the  time  integration. 
The  system  of  discretized  nonlinear  equations  is 
solved  simultaneously  using  the  full  Newton 
method.  The  simulation  result  shown  in  Figure  4 
can  be  obtained  within  20  minutes  (CPU  time)  on 
a  DECstation  5000/200. 

4.  SIMULATION  RESULTS 

A  two-electrode  FP  cavity  InGaAs/InP  laser  diode 

[5]  has  been  simulated.  An  example  of  set-reset 
operation  (undershoot  switching)  using  a  single 
wavelength  (Ain=  1557.6  nm)  light  is  shown  in 
Figure  2.  The  bias  current  is  chosen  to  be  7j  = 
18.4  mA.  The  intensities  of  pulses  for  set  and  reset 
are  0.874mW  and  13.12mW,  respectively. 

The  mechanism  of  undershoot  switching  is 
believed  to  be  dependent  on  dilferent  changing 
rates  of  carrier  densities  in  the  gain  and  absorption 
regions  under  a  variation  of  the  optical  excitation. 
We  have  found  that  the  shape  of  the  pulses  plays 
an  important  role  in  undershoot  switching,  as 
shown  in  Figures  3  and  4. 

From  our  simulation  results,  a  subnanosecond 
switch-on  (rise)  time  is  observed,  while  the  switch- 
off  (fall)  time  of  the  device  is  about  3  ns.  We  also 
found  that  an  input  pulse  with  a  higher  intensity 
and  faster  fall  time  would  reduce  the  switch-off 
time  somewhat  but  still  no  less  than  2  ns. 

The  wavelength  of  the  injected  optical  pulses  is 
also  crucial  for  the  undershoot  switching.  The 
minimum  set/reset  powers  will  be  modulated  by 
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FIGURE  2  The  set-reset  operation  (undershoot  shooting): 
optical  input  and  optical  output. 


FIGURE  3  Influence  of  the  shape  (rise/fall  time)  of  the  input 
optical  pulses. 


the  resonant  characteristic  of  the  Fabry-Perot 
cavity  as  shown  in  Figure  5.  The  two  curves  are 
almost  parallel.  Both  have  minima  near  the 
resonant  wavelength. 
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FIGURE  4  Examples  of  the  set-reset  operation:  the  effects  of 
pulse  shape  on  undershooting  for  Wm=  13.12  mW,  Ain  = 

1 557.6  nm. 
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FIGURE  5  Influence  of  the  wavelength  of  the  optical  signals 
on  the  set-reset  operation. 

5.  CONCLUSION 

In  this  paper,  a  comprehensive  transient  1-D 
computer  model  for  the  simulation  of  multi¬ 


section  bistable  laser  diodes  is  presented.  Traveling 
wave  rate  equations  are  solved  for  each  long¬ 
itudinal  mode  and  optical  input.  All  the  important 
effects  have  been  included  in  this  model. 

Analyses  of  two-electrode  FP  cavity  InGaAsP/ 
InP  laser  diodes  incorporating  an  absorber  region 
are  carried  out  to  illustrate  the  capability  of  the 
simulator.  Special  attention  is  paid  to  the  optical 
set-reset  operation  (undershoot  switching).  Both 
the  height  and  shape  (rise/fall  time)  as  well  as 
the  wavelength  of  the  optical  input  pulse  are 
shown  to  play  an  important  roles  in  undershoot 
switching. 
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Intersubband  Relaxation  in  Step  Quantum 
Well  Structures 
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Intersubband  relaxation  due  to  electron  interactions  with  the  localized  phonon  modes 
plays  an  important  role  for  population  inversion  in  quantum  well  laser  structures 
designed  for  intersubband  lasers  operating  at  mid-infrared  to  submillimeter  wave¬ 
lengths.  In  this  work,  intersubband  relaxation  rates  between  subbands  in  step  quantum 
well  structures  are  evaluated  numerically  using  Fermi’s  golden  rule,  in  which  the 
localized  phonon  modes  including  the  asymmetric  interface  modes,  symmetric  interface 
modes,  and  confined  phonon  modes  and  the  electron -phonon  interaction  Hamilto¬ 
nians  are  derived  based  on  the  macroscopic  dielectric  continuum  model,  whereas  the 
electron  wave  functions  are  obtained  by  solving  the  Schro dinger  equation  for  the 
heterostructures  under  investigation.  The  sum  rule  for  the  relationship  between  the  form 
factors  of  the  various  localized  phonon  modes  and  the  bulk  phonon  modes  is  examined 
and  verified  for  these  structures.  The  intersubband  relaxation  rates  due  to  electron 
scattering  by  the  asymmetric  interface  phonons,  symmetric  interface  phonons,  and 
confined  phonons  are  calculated  and  compared  with  the  relaxation  rates  calculated 
using  the  bulk  phonon  modes  and  the  Frohlich  interaction  Hamiltonian  for  step 
quantum  well  structures  with  subband  separations  of  36meV  and  50meV,  correspon¬ 
ding  to  the  bulk  longitudinal  optical  phonon  energy  and  interface  phonon  energy, 
respectively.  Our  results  show  that  for  preferential  electron  relaxation  in  intersubband 
laser  structures,  the  effects  of  the  localized  phonon  modes,  especially  the  interface 
phonon  modes,  must  be  included  for  optimal  design  of  these  structures. 


Keywords:  Localized  phonons,  relaxation  rates,  intersubband  laser 


1  INTRODUCTION 

In  recent  years,  novel  tunneling  injection  lasers 
using  structures  based  on  semiconductor  quantum 


wells  have  provided  the  means  for  achieving  lasing 
wavelengths  from  the  mid-infrared  to  the  sub¬ 
millimeter  wave  region  [1,2,3].  These  lasers 
incorporate  narrow  quantum  well  regions  that 
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must  have  thicknesses  as  small  as  30-50  A  and 
have  transition  level  separations  engineered  to  one 
unit  or  a  multiple  of  bulk  Longitudinal-Optical 
(LO)  phonon  energy.  It  is  now  well  known  that  the 
shape  and  energies  of  LO  phonon  modes  are 
modified  by  quantum  wells.  Specifically,  LO 
phonons  in  quantum  wells  may  be  described  by 
the  dielectric  continuum  model  in  terms  of 
confined  and  interface  modes  that  have  properties 
different  from  those  of  bulk  phonons  [4, 5, 6].  In 
this  paper,  the  interface  and  confined  LO  phonon 
modes  of  the  continuum  model  [7]  are  used  with 
Fermi’s  golden  rule  to  calculate  relaxation  rates 
between  quasi  bound  states  in  step  quantum  well 
structures.  We  show  that  the  electron  relaxation 
rates  are  dominated  by  the  interface  phonon 
scattering,  and  that  the  symmetries  of  phonon 
modes  induce  inherent  selection  rules  of  transi¬ 
tions  among  well  levels.  In  addition,  the  sum  rule 
for  the  form  factors  [6]  is  verified  numerically  for 
the  step  quantum  well  structures. 


2  INTERFACE  PHONON  ASSISTED 
TRANSITION 

The  well  structure  in  this  study  is  shown  in  Figure 
1,  denoting  the  innermost  GaAs  to  be  layer  1,  the 
next  Alo.25Gao.75As  layer  2  and  the  outermost 
Alo.6Gao.4As  layer  3.  There  are  six  types  of  phonon 
modes  due  to  the  double  interfaces  between  layers 
1  and  2:  (1)  symmetric  interface  modes,  (2)  anti¬ 
symmetric  interface  modes,  (3)  confined  LO 
modes,  (4)  confined  TO  modes,  and  (5)  half-space 
LO  modes.  With  the  eigenmodes  given  by  [6],  we 
can  obtain  the  electron-phonon  interaction  from 
the  Frohlich  continuum  model 

He-p  =  e  jd2r  J  dz  p(r,z)4>(r,z),  (1) 


FIGURE  1  Well  potential  and  wavefunctions  of  the  first  two 
levels. 


only  the  antisymmetric  modes  may  contribute 
significantly  to  electron  transition  rates  between 
these  two  levels.  For  the  antisymmetric  phonon 
modes, 
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where  q  is  the  two  dimensional  phonon  wavevector 
parallel  to  the  heterostructure  interface, 
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where  p(r,  z)  is  the  generalized  electron  density 
operator  and  <j)( r,  z)  the  electrostatic  potential 
created  by  the  LO  phonons.  Since  wavefunc¬ 
tions  of  the  first  two  levels  have  opposite  parities, 


where  uTn  and  uLn  are  the  frequencies  of  the 
longitudinal  optical  (LO)  phonon  and  the  trans¬ 
verse-optical  (TO)  phonon,  respectively.  uja±  are 
the  antisymmetric  mode  frequencies  and  are  given 
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by  solutions  of  the  following  equation, 

ei  (a/)  coth  Q  Qaj  +  e2M  =  0.  (4) 

For  long  wavelengths  they  approach  ujlx  and 
fA  is  defined  as 

{_e9(z+^)?  z  <~\a 

sinh(gz)/ sinh(^a),  -\a  <z  <\a 

z  >\a 

(5) 

The  electron-phonon  interaction  induces  elec¬ 
tron  transition  between  the  two  well  levels.  Using 
Fermi’s  golden  rule,  the  rates  of  transitions 
assisted  by  various  modes  of  the  interface  phonon 
and  bulk  phonon  are  calculated  as  functions  of 
electron  in-plane  energy.  In  Figure  2,  numerical 
results  of  the  antisymmetric  and  bulk  phonon 
assisted  transitions  are  given  at  a  well  level  energy 
separation  of  E2-Ex  =  47  meV,  which  shows  that 
the  antisymmetric  interface  phonon  assisted  tran¬ 
sition  dominates  at  a  level  separation  close  to  the 
antisymmetric  phonon  frequency.  Contributions 
from  the  other  phonon  modes  are  generally 
negligible. 

The  maximum  transition  rates  occur  when  the 
electron  in-plane  energy  compensates  the  differ¬ 
ence  between  the  mode  frequency  and  energy  level 
separation.  Figure  3  presents  a  comparison  of  the 
maximum  transition  rates  of  the  bulk  mode  and 
antisymmetric  mode  at  various  values  of  E2  —  Ex, 
which  are  obtained  by  varying  the  quantum  well 
width.  A  device  structure  which  is  engineered  with 
an  energy  level  separation  of  the  antisymmetric 
interface  mode  frequency  would  have  a  relaxation 
rate  several  times  larger  than  that  with  a  bulk 
mode  frequency  separation,  as  shown  in  Figure  3. 

For  the  structure  under  study,  the  upper 
antisymmetric  mode  plays  the  most  important 
role  in  the  transition  between  the  energy  level  2 
and  level  1.  For  other  transitions  (e.g.  between 
level  3  and  level  1)  the  symmetric  modes  may 
become  important.  The  selection  rule  is  due  to  well 


FIGURE  2  Transition  rates  of  antisymmetric  and  bulk 
modes. 


defined  parities  of  the  antisymmetric  and  sym¬ 
metric  modes.  In  most  cases,  it  is  also  applicable  to 
non-symmetric  well  structures  to  a  certain  degree, 
but  the  specific  phonon  modes  differ  in  different 
device  structures  and  will  have  to  be  derived  for  a 
specific  structure. 
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3  SUM  RULE  OF  FORM  FACTORS 


The  form  factors  [6]  can  be  defined  through 
effective  electron -electron  interaction  due  to 
exchange  of  optical  phonons.  The  form  factors 
are  independent  of  material  parameters  but 
depend  on  the  spatial  shape  of  the  phonon  modes 
and  subband  electron  wavefunctions.  In  this 
double  heterostructure,  the  form  factors  for  a 
subband  and  different  phonon  modes  are  given  by 

1  T  f°°  1 2 

'^-i+ssasl  . 

•j  r  p  oo  "1 2 

rM = r;coth(i9„)  [Ldz M-rf/Av). , 

a  n- 

Fc{q)  =  J_dz  fjmrfW)? 


l+coth(^a)  L7-o 


-g\z-/\  1  cosh(gz)cosh(^z/) 

1  +  tanh  {\qa)  cosh  2{\qa) 

1  sinh(^z)sinh(^z/)" 
l-fcoth(^a)  sinh2(f^a) 


poo  poo 

.{q)=Ja_  dzja  dz,\il>(z)\2\ip(z')\2 


(q)=  f  dz  j  <fe'|V’(z)|2|i 

J  —OO  J  — oo 

[g-?|z-2/|  _  Jg-9|z+z'+o|  j  _ 


FH-{q)  = 


where  /s  is  given  by 


fs{qz)=l  cosh(?z)/cosh(f^), 


z  <  —\a 
—\a  <  z  <  \a 
z  >\a 


These  form  factors  always  satisfy  the  sum  rule  [6] 

FB{q)  =  Fs{q)  +  FA  (q)  +  Fc(q ) 

+  FH+(q)  +  FH-(q), 

which  is  a  direct  consequence  of  the  complete 
orthonormality  of  the  interface  eigenmodes.  Ver¬ 


ifying  the  sum  rule  provides  a  justification  of  our 
numerical  calculations.  The  sum  rule  between  the 
various  form  factors  has  been  verified  for  different 
well  levels  of  the  step  quantum  well  structure.  As 
an  example,  in  Figure  4,  the  calculated  form 
factors  for  level  2  are  plotted.  The  sum  of  form 
factors  of  the  symmetric  mode  and  half  space 
modes  is  indeed  equal  to  that  of  the  bulk  mode. 
The  form  factors  for  the  confined  LO  mode  and 
antisymmetric  mode  in  this  case  are  negligible 
compared  to  those  for  the  symmetric  and  half 
space  (H+  and  H-)  modes,  although  this  is  not 
always  true  for  other  levels. 

4  CONCLUSION 

By  examining  and  comparing  electron  transition 
rates  induced  by  bulk  and  interface  phonon 
modes,  our  calculations  for  step  quantum  well 
structures  designed  for  intersubband  lasers  have 
shown  that,  to  facilitate  preferential  electron 
relaxation,  it  is  important  to  consider  the  transi¬ 
tion  induced  by  the  interface  phonon  modes  and 
have  electron  subbands  engineered  with  their 
separation  close  to  the  interface  phonon  energy 
in  the  device  structures. 


inplane  wavevector  (1/m) 


FIGURE  4  Sum  rule  of  form  factors. 
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Resonances  in  Conductance  Through  Tunable  Attractors 

YONG  S.  JOE*  and  RONALD  M.  COSBY 

Department  of  Physics  and  Astronomy,  Ball  State  University,  Muncie,  IN  47306,  USA 


We  present  the  effects  of  resonant  tunneling  through  doubly-coupled  attractors  in  series 
in  a  quantum  nanosystem.  It  is  found  that  multiple  split-tunneling  peaks  arise  from 
resonant  tunneling  through  multiple  nondegenerate  quasi-bound-states  of  the  coupled 
attractors.  We  show  that  the  tunability  of  coupling  is  achieved  by  modulating  the 
strength  of  the  attractor  for  a  fixed  separation  length  between  the  attractors,  and  by 
adjusting  the  geometrical  separation  length  for  a  fixed  intervening  barrier  strength. 


Keywords:  Nanostructures,  Semiconductors,  Tunneling 


1.  INTRODUCTION 

Current  state-of-the-art  techniques  result  in  sur¬ 
face-independent  probes  or  contacts  that  may  be 
biased  to  introduce  tunable  scattering  potentials 
(“impurities”)  in  quantum  nanosystems  [1].  Many 
theoretical  and  experimental  works  then  have 
studied  the  elastic  scattering  by  impurities  in  these 
systems.  In  particular,  for  a  quantum  structure 
containing  a  finite-size  attractive  impurity,  called 
an  ‘attractor’,  it  is  shown  that  an  attractor  intro¬ 
duces  multiple  quasi-bound-states  (QBSs)  in  the 
channel  for  a  sufficiently  strong  attractor,  and 
these  states  give  rise  to  multiple  resonant  peaks 
before  the  first  plateau  in  the  conductance  [2]. 
These  conductance  peaks  appear  due  to  resonant 
tunneling  through  the  QBSs. 


In  this  paper,  we  study  the  splitting  of  multiple 
resonant  peaks  before  the  first  plateau  in  the 
conductance  for  a  quantum  nanosystem  with  two 
identical  attractors  in  series.  The  separation  of  the 
split-conductance  peaks  through  both  the  first  and 
the  second  QBSs  depends  strongly  on  the  coupling 
between  the  two  attractors,  which  can  be  tuned  by 
modulating  the  intervening  barrier  strength  [3] 
and/or  changing  the  geometrical  separation 
length. 

2.  MODEL  AND  METHOD 

A  sketch  of  the  model  two-dimensional  nanosys¬ 
tem  with  two  identical  attractors  in  series  is 
illustrated  in  Figure  1(a),  where  relevant  para- 
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FIGURE  1  (a)  Quantum  structure  studied:  Two  identical 

finite-size  attractors  are  coupled  with  the  coupling  length  d 
along  the  transport  direction,  (b)  A  schematic  potential  profile 
of  double  attractors  along  the  channel,  where  two-nondegene- 
rate  sets  of  QBSs  are  shown  in  a  potential  well. 


meters  are  defined.  We  present  results  for  the  case 
where  WfWa  =  2,  WJLa=l,  and  the  attractors 
are  centered  in  the  channel.  We  note  that  similar 
types  of  the  structure  modeled  here,  using  several 
different  techniques,  have  been  realized  experi¬ 
mentally  [4],  For  instance,  Geim  et  al  [5]  studied 
resonant  tunneling  through  “donor  pair-mole¬ 
cules”  with  random  separation  in  quantum  de¬ 
vices.  Figure  1(b)  illustrates  a  schematic  potential 
profile  of  double  attractors  along  the  channel, 
where  two-nondegenerate  sets  of  QBSs  are  shown 
in  a  potential  well 

Our  calculations  of  the  conductance  are  based 
on  a  nearest-neighbor  tight-binding  (TB)  Hamil¬ 
tonian  and  a  recursive  Green’s  function  technique. 
Electrons  in  this  system  are  free  to  move  in  the  x- 
direction  but  are  confined  in  the  j-direction,  as 
described  by  the  Schrodinger  equation  {—ti2/2m 


Vl+Vc(y)-En}il>n(y)=  0,  where  Vc(y)  =  oo 
for  the  regions  defined  by  hard  walls  [shown  as 
cross-shaded  regions  in  Figure  1(a)],  and 
Ve(y)  =  —  Fatt  in  the  region  with  the  attractors 
for  \2y\  <  Wa  [shown  as  shaded  regions],  and 
otherwise  Vc(y)  =  0. 

By  discretizing  the  system  spatially  with  lattice 
constant  a ,  the  resulting  TB  Hamiltonian  is  exactly 
solved  by  using  the  recursive  Green’s  function 
method  [6].  The  Green’s  function  G  of  a  system  in 
the  presence  of  a  perturbation  V  is  related  to  the 
Green’s  function  G°  in  the  absence  of  V.  The 
necessary  recursion  relations  are  obtained  by 
taking  the  appropriate  matrix  elements  of  Dyson’s 
equation,  G  =  G°  +  G°VG.  The  conductance  is 
then  obtained  using  the  Landauer  equation  [7],  G 
=  (2 e2/h)  |fMl/|2  =  (2e2/h)  Tr  (ttf ),  where  t  is  the 

transmission  matrix. 


3.  RESULTS  AND  DISCUSSIONS 

We  investigate  conductance  peak  splitting  due  to 
resonant  tunneling  through  multiple  QBSs  in  a 
quantum  system  with  two  attractors  in  series. 
Figure  2  shows  the  conductance  G  of  the  structure 
as  a  function  of  the  Fermi  energy  for  different 
attractor  strengths.  (Here,  the  Fermi  energy  is 
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FIGURE  2  Splitting  of  conductance  peaks  before  the  first 
plateau  as  a  function  of  the  Fermi  energy  for  different  attractor 
strengths  Vati  and  a  constant  coupling  length. 
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normalized  with  respect  to  the  first  subband  in  the 
channel  W,  Ex=h27i2/2mW 2).  These  curves  are 
vertically  offset  one  with  respect  to  the  other  by 
one  conductance  unit  for  clarity.  It  is  clearly  seen 
that  resonant  tunneling  peaks  [two  for  Fatt 
(=  y/Vaxt/Ei)  in  the  range  from  -2.55  to  -2.70 
and  four  for  Fatt  from  —2.75  to  —2.80]  appear 
before  the  first  plateau  (EF/EX  —  1)  in  the  con¬ 
ductance.  The  appearance  of  these  peaks  is  a 
consequence  of  both  the  formation  of  multiple 
QBSs  in  the  attractor  [2]  [when  an  incident  energy 
of  the  electron  becomes  the  resonant  energy 
the  longitudinal  momenta  pHiX  are  enhanced 
because  the  QBSs  Ef1  have  negative  values 
(pn  x  =  y/2m(Ef*  -  £*“))],  and  the  overlap  of 
these  QBSs  due  to  the  coupling  between  the  two 
attractors  [3]. 

First,  we  note  that  the  resonance  widths  of  each 
peak  through  Ef t  and  Efx  are  very  different;  a 
qualitative  explanation  follows.  While  EF  varies 
from  0  to  Eu  all  subbands  Em  in  the  narrow 
channel  without  an  impurity  are  evanescent  modes. 
The  mode  coupling  between  Em  and  Ef1  plays  an 
important  role  in  determining  the  interaction 
matrix  element,  which  is  a  vital  factor  in  the 
calculation  of  the  transmission  coefficient  and  the 
conductance.  Intermode  couplings  between  Em  and 
Efx  for  (m,n)  combinations  with  even  and  odd 
modes  yield  negligible  contributions  because  the 
modes  have  opposite  parities.  The  resonant  tunnel¬ 
ing  through  E  ftt  is  mainly  due  to  the  coupling 
between  Ex  and  E  ftt.  Similarly,  the  resonant  peak 
through  Ef1  is  due  to  the  coupling  between  E2  and 
Ef1.  However,  since  the  evanescent  mode  E2  is 
always  energetically  further  away  from  the  Fermi 
energy  than  Ex,  the  coupling  between  E2  and  Efx  is 
weaker  than  that  between  Ex  and  Ef*.  Therefore, 
when  an  electron  moves  from  E2  to  the  Ef1  state,  it 
stays  a  longer  time  in  Ef1  and  barely  tunnels 
through  it  because  of  weak  coupling.  This  indicates 
that  the  mean  lifetime  is  larger  and  the  width  of  the 
tunneling  peak  is  narrower. 

Second,  we  discuss  the  splitting  of  tunneling 
peaks  in  the  presence  of  coupling  between 
attractors.  For  two  identical  series  attractors,  the 


single  resonant  peak  through  the  first  QBS  of  Ef 
splits  into  two  well-separated  peaks,  shown  in 
Figure  2  in  the  lower  range  of  the  Fermi  energy  for 
most  values  of  Fatt  displayed.  [Here,  the  first  half 
of  the  split  peaks  is  not  shown  for  Fatt  =  -2.78 
and  -2.80].  In  addition  to  these  peaks,  a 
conductance  peak  through  the  second  QBS  of 
E  f  also  splits  into  two  sharp  spikes  for  Fatt  = 
-2.75,  —2.78  and  —2.80.  A  more  careful  examina¬ 
tion  shows  that  each  tunneling  peak  through  Ef 
has  a  Lorentzian  shape  centered  around  the 
resonance  energy  [2].  We  attribute  this  peak 
splitting  to  the  elimination  of  the  energy  degen¬ 
eracy  due  to  the  coupling  between  two  attractors. 
In  other  words,  if  the  separation  between  identical 
attractors  is  small  enough,  then  the  aligned, 
uncoupled  QBSs  in  each  attractor  overlap  and 
produce  new  nondegenerate  states  of  the  coupled 
attractor  system.  For  a  simple  two-attractor 
model,  one  coupled  state  is  slightly  higher  (£**j) 
and  the  other  slightly  below  (is*^)  the  uncoupled 
energy  (Ef1)  [see  Fig.  1(b)],  with  a  perfect 
tunneling  rate  indicated  (i.e.,  transmittance 
through  these  states  reaches  one).  Therefore, 
multiple  split-conductance  peaks  arise  from  the 
resonant  tunneling  through  multiple  nondegene¬ 
rate  QBSs  of  the  coupled  attractors. 

Next,  we  discuss  the  effect  of  the  coupling 
between  the  attractors  on  conductance  peak 
splitting.  It  is  expected  that  the  separation  of  the 
split-tunneling  peaks  in  conductance  is  reduced,  as 
the  coupling  between  two  attractors  becomes 
weaker.  This  result  is  observed  in  the  double  and 
triple  quantum  dots,  where  a  coupling  (i.e.,  tunnel 
barrier)  between  the  dots  is  controlled  by  separate 
gates  [8].  Here,  we  show  that  the  tunability  of 
coupling  can  also  be  achieved  by  modulating  (1) 
the  strength  of  the  attractor  Vatt  for  a  fixed 
separation  length  between  the  attractors,  and  (2) 
the  coupling  length  d  (i.e.,  geometrical  separation) 
for  a  fixed  intervening  barrier  strength. 

As  the  attractor  depth  |Fatt|  increases,  in  Figure 
2,  the  separation  of  the  split  peaks  through  both 
the  first  and  the  second  QBS  becomes  smaller.  This 
is  clearly  seen  in  Figure  3,  where  the  energy 
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FIGURE  3  The  separation  of  peak  splitting  Asps  as  a 
function  of  |Katt|  is  plotted,  for  a  fixed  coupling  length  dj 
Wa  =  1 .0,  indicating  that  weak  coupling  between  the  attractors 
can  be  achieved  by  increasing  Katt  resulting  in  strong  localized 
QBSs  in  the  attractor. 

separation  of  peak  splitting,  Asps,  as  a  function  of 
|  Fan  |  is  plotted.  Here,  Asps  through  both  non¬ 
degenerate  states  Efl  (triangles)  and  Efl  (squares; 
multiplied  by  a  factor  of  10  for  clarity)  decreases, 
albeit  with  different  slopes.  This  can  be  explained 
by  the  fact  that  for  a  constant  geometrical 
coupling  between  the  attractors,  a  larger  |Fatt| 
makes  the  QBSs  lower  in  energy  and  more 
localized  in  the  attractor,  and  therefore  results  in 
less  interactions  or  overlaps  between  them.  As  a 
consequence,  the  separation  of  peak  splitting  in 
conductance  decreases  as  |FaU|  increases.  We 
expect  that  the  peak  splitting  will  physically 
disappear  when  |Fatt|  is  large  enough  to  produce 
strong  localized  QBSs  in  each  attractor,  i.e.,  for 
an  extremely  weak  coupling  case. 

Figure  4  shows  the  calculated  conductance  as  a 
function  of  the  Fermi  energy  for  various  coupling 
lengths.  In  order  to  see  the  effect  of  the  geometrical 
coupling  between  the  attractors  on  tunneling  peak 
splitting,  we  focus  on  doubly  split-tunneling  peaks 
through  the  first  two-nondegenerate  QBSs 
and  is  f*j)  for  a  fixed  Fatt  =  -2.75.  The  geometrical 
separation  length  varies  from  d/Wa  —  0.5  (bottom 
curve)  to  d/Wa  =  3.0  (top  curve)  in  steps  of  0.1,  and 
the  origin  of  the  curve  for  different  coupling  length 
is  shifted  vertically  in  order  to  make  a  clear 


FIGURE  4  The  doubly  split-conductance  peaks  through  the 
first  two-nondegenerate  QBSs  for  different  coupling  length.  The 
geometrical  separation  length  varies  from  d/Wa=  0.5  (bottom 
curve)  to  d/Wa  =  3.0  (top  curve)  in  steps  of  0.1,  and  the  curves 
are  vertically  shifted  for  clarity. 

comparison.  As  d/Wa  increases  (i.e.,  the  attractors 
are  weakly  coupled),  the  level  separation  A Ex 
(—  Ef^  —  isfj)  becomes  narrower  and  hence  the 
separation  of  the  split-conductance  peaks  through 
these  states  is  reduced  in  Figure  4.  It  is  expected 
that  when  the  attractors  are  far  away  from  each 
other,  each  one  has  its  original  QBS  Efl,  and  these 
degenerate  states  give  rise  to  a  single  resonant  peak 
before  the  first  plateau.  Notice  that  as  djWa 
increases,  the  width  of  the  peak  through  Ef^ 
becomes  narrower  because  the  QBS  is  is  more 
bound  and  localized  around  the  attractor.  As 
expected,  this  feature  can  be  seen  in  the  opposite 
way  through  the  QBS 

Figure  5  shows  how  the  separation  of  the  split 
peaks  changes  with  increasing  coupling  length 
d/Wa.  It  appears  that  the  energy  separation  of  peak 
splitting,  Asps,  decreases  exponentially  as  coupling 
length  djWa  increases  for  Fatt  =  —2.75.  Therefore, 
for  the  split-conductance  peaks  through  both  2sftt 
and  E |u,  the  coupling  length  d,  in  general,  is  related 
to  Asps  as  dtt  —W  In  (Asps)  for  different  Fatt. 

4.  CONCLUSIONS 

In  summary,  we  have  studied  the  effects  of 
resonant  tunneling  through  doubly-coupled  at- 
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FIGURE  5  A  relationship  between  the  separation  of  the  split 
peaks  Asps  and  coupling  length  d/Wa ,  showing  that  Asps 
decreases  exponentially  as  d/Wa  increases  for  Fatt  =  —2.75. 


tractors  in  series  in  a  quantum  nanosystem.  We 
have  found  that  the  doubly  split-conductance 
peaks  arise  from  the  resonant  tunneling  through 
two  nondegenerate  QBSs  of  the  coupled  attrac¬ 
tors,  and  the  separation  of  the  split-tunneling 
peaks  are  strongly  affected  by  the  coupling  length 
and  the  intervening  barrier  strength. 
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Convergence  Properties  of  the  Bi-CGSTAB  Method  for  the 
Solution  of  the  3D  Poisson  and  3D  Electron  Current 
Continuity  Equations  for  Scaled  Si  MOSFETs 
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As  semiconductor  technology  continues  to  evolve,  numerical  modeling  of  semiconduc¬ 
tor  devices  becomes  an  indispensible  tool  for  the  prediction  of  device  characteristics. 

The  simple  drift-diffusion  model  is  still  widely  used,  especially  in  the  study  of  subthres¬ 
hold  behavior  in  MOSFETs.  The  numerical  solution  of  these  two  equations  offers 
difficulties  in  small  devices  and  special  methods  are  required  for  the  case  when  dealing 
with  3D  problems  that  demand  large  CPU  times.  In  this  work  we  investigate  the 
convergence  properties  of  the  Bi-CGSTAB  method.  We  find  that  this  method  shows 
superior  convergence  properties  when  compared  to  more  commonly  used  ILU  and  SIP 
methods. 


Keywords:  Semiconductor  devices,  device  modeling,  Bi-CGSTAB  method,  discrete  impurities, 
subthreshold  conduction 


1.  INTRODUCTION 

The  demand  of  having  more  transistors  on  a  single 
chip  continues  to  push  MOSFETs  toward  shorter 
channel  lengths.  In  order  to  maintain  the  long- 
channel  behavior  and  suppress  undesirable  short- 
channel  effects,  channel  doping  levels  must  increase, 
whereas  oxide  thickness,  junction  depths  and 
isolation  spacing  decrease.  To  make  final  threshold 
voltage  adjustments  and  to  eliminate  the  punch- 
through  effect,  ion  implantation  is  used  at  several 


steps  in  the  fabrication  process  of  these  state-of- 
the-art  devices.  The  stochastic  nature  of  ion 
implantation  across  the  wafer  leads  to  significant 
deviation  in  both  the  number  of  dopants  and  the 
arrangement  of  these  dopant  atoms  from  device  to 
device  [1,2].  While  a  continuum  doping  model 
may  be  a  valid  assumption  for  large  devices,  it  fails 
in  the  case  of  deep-submicrometer  devices.  For 
example,  in  MOSFETs  with  effective  channel 
lengths  and  widths  on  the  order  of  0.1pm  and 
channel  doping  of  5xl017cm"3,  the  number  of 
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impurity  atoms  in  the  inversion  layer  is  less  than 
100.  Thus,  differences  in  the  arrangement  of 
dopant  atoms,  as  well  as  fluctuations  in  the  number 
of  dopants  under  the  gate,  lead  to  significant 
variation  of  the  off-state  leakage  current  in  these 
state-of-the-art  devices. 

The  high  doping  levels  used  in  submicrometer 
devices  lead  to  large  gradients  in  the  potential  and 
charge  within  the  device,  which  requires  a  very  fine 
mesh.  This  makes  some  of  the  commonly  used 
iterative  methods  for  the  solution  of  both  3D 
Poisson  and  3D  electron  current  continuity 
equations  rather  inefficient  because  of  poor  con¬ 
vergence  properties.  To  overcome  this,  we  use  a 
variant  of  the  Conjugate  Gradient  Squared  meth¬ 
od  (CGS),  the  so-called  Bi-CGS  Stabilized  (Bi- 
CGSTAB)  method  [3].  The  convergence  properties 
of  this  method  are  found  to  be  superior  to  those  of 
the  Incomplete  Lower-Upper  (ILU)  factorization 
method  [4].  A  variant  of  the  ILU  method,  Stone’s 
[5]  Strongly  Implicit  Procedure  (SIP)  was  also 
investigated,  but  found  to  be  comparable  to  ILU. 
Finally,  we  discuss  our  3D  simulations  of  the 
subthreshold  characteristics  of  0.1  pm  gate-length 
MOSFETs  with  various  gate-widths. 

2.  APPROACH 

The  geometrical  and  structural  parameters  of  the 
device  used  in  our  numerical  analysis  are  illu¬ 
strated  in  Figure  1.  The  depth  of  the  discrete 
doping  region  is  0.14  pm  which,  for  substrate 
doping  A^  =  5xl017cm-3  and  oxide  thickness 
tox  =  3  nm,  corresponds  to  more  than  twice  the 
depth  of  the  depletion  region.  The  3D  Poisson  and 
electron  current  continuity  equation  are  solved 
assuming  Fermi-Dirac  (degenerate)  statistics  with 
Dn  and  pn  constant. 

The  boundary  conditions  for  cp,  n  and  p  are  as 
follows:  Contact  regions  are  assumed  to  be  charge 
neutral  so  the  electrostatic  potential  ip  and  the 
carrier  concentrations  n  and  p  at  the  contacts  are 
prescribed  by  the  usual  Dirichlet  conditions.  At 
the  free  surfaces  and  artificial  boundaries,  the 


gate 


discrete  doping  region 

FIGURE  1  Schematic  view  of  the  device  structure.  The  box 
underneath  the  gate  shows  the  spatial  extension  of  the  discrete 
doping  region. 

perpendicular  field  and  the  perpendicular  current 
are  set  to  zero.  We  use  the  decoupled  scheme  [6]  in 
which,  the  linearized  Poisson  equation  is  repeat¬ 
edly  solved  for  the  electrostatic  potential  <p.  The 
resulting  values  of  ip  are  then  used  to  obtain  the 
improved  values  for  n  and  /?.  This  loop  corre¬ 
sponds  to  one  Gummel  cycle.  The  above  process  is 
repeated  until  self-consistent  values  of  p,  n  and  p 
are  obtained. 

The  application  of  finite-difference  techniques  to 
both  3D  Poisson  and  3D  electron  current  con¬ 
tinuity  equations  leads  to  algebraic  equations 
having  a  well-defined  structure,  represented  by 
Ax  =  b.  The  most  suitable  methods  for  solution  of 
these  two  equations  are  direct  methods,  but  the 
computational  cost  becomes  prohibitive  as  the 
number  of  equations  increases.  This  has  led  to 
iterative  procedures  that  utilize  the  structure  of  the 
coefficient  matrix.  The  ILU  methods,  and  the  use 
of  preconditioning  together  with  conjugate  gradi¬ 
ents,  provide  a  significant  increase  in  the  power  of 
iterative  methods. 

Within  incomplete  factorization  schemes  [4],  the 
original  matrix  A  is  decomposed  into  a  product  of 
lower  and  upper  triangular  matrices  L  and  U 
respectively.  This  is  achieved  by  modifying  matrix 
A  through  the  addition  of  a  small  matrix  N.  Thus, 
one  solves  the  modified  system  LUx=b+Nx  by 
solving  successively  the  matrix  equations  LV^ 
b  +  Nx  and  V  =  Ux,  where  V  is  an  auxiliary  matrix. 
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The  superfluous  terms  of  N  affect  the  convergence 
of  the  ILU  method.  Stone  [5]  suggested  the 
introduction  of  partial  cancellation  which  mini¬ 
mizes  the  influence  of  these  additional  terms  and 
accelerates  the  rate  of  convergence.  This  partial 
cancellation  procedure  is  implicitly  included  in  the 
calculation  of  L  and  U.  To  provide  rapid 
convergence,  the  fraction  that  is  canceled  is  varied 
on  successive  pairs  of  iterations. 

The  basic  CG  algorithm  loses  its  applicability 
when  the  resulting  system  of  equations  is  not  SPD, 
as  it  is  the  case  with  the  electron  current  continuity 
equation.  In  such  circumstances,  the  best  alter¬ 
natives  are  Lanczos-type  algorithms.  In  recent 
years,  the  CGS  method  due  to  Sonneveld  [7]  has 
been  recognized  as  an  attractive  transpose-free 
variant  of  the  Bi-Conjugate  Gradient  (Bi-CG) 
iterative  method.  The  Bi-CGSTAB  method  due  to 
Van  der  Vorst  [3]  is  a  variant  of  the  CGS  algorithm 
which  avoids  squaring  of  the  residual  polynomial. 
The  convergence  behavior  of  this  method  is 
smoother  because  it  produces  more  accurate 
residual  vectors  and,  therefore,  more  accurate 
solutions.  To  reduce  the  spectral  conditioning 
number  and  improve  the  distribution  of  the 
smallest  eigenvalues,  one  applies  preconditioning 
techniques  together  with  conjugate  gradient 
solvers.  Successful  preconditioning  matrix  can  be 
obtained  by  using  ILU  factorization  [8]:  If  L  and 
U  are  the  strictly-lower  and  the  strictly-upper 
triangular  parts  of  A,  then  the  preconditioning 
matrix  is 

Kilu  =  (L-bD)D-1(U  +  D),  (1) 

where  diag(KILU)  =  diag(A).  Once  the  diagonal  D 
is  computed,  one  scales  the  original  matrix  A 

A  =  D-WAD-W  =  diag(A)  +  L  +  U.  (2) 

The  preconditioning  matrix  for  this  symmetrically 
scaled  matrix  is  of  the  form 

K=(L  +  I)(I  +  U),  (3) 


where  I  is  the  identity  matrix.  The  Bi-CGSTAB 
method  is  now  applied  to  the  preconditioned 
system 

(L  +  I)-1A(I  +  U)-1x  =  (L  +  I)-1b,  (4) 

where  b=D_1/,2b  and  x=D-1/,2(I  +  U)_1x.  In  the 
calculation  of  the  product  (L  +  I)_1A(I  +  U)_1p, 
significant  amount  of  extra  work  is  avoided  by 
using  Eisenstat’s  trick  [9]. 

In  [1],  the  implantation  process  was  simulated  in 
the  following  way:  n  impurity  atoms  were  placed  in 
a  region  Vtot  which  was  8000  times  larger  than  the 
discrete  doping  region  Fdisc.  Only  those  atoms  (k) 
that  fell  within  the  discrete  doping  region  were 
retained.  In  this  way,  one  generates  a  binomial 
distribution  [10],  which  in  the  limit  n  =  VaFtot— 

P  -  Fdisc /  Ftot— >0  and  «/>->  a  (where  a  =  Na  Fdisc 
is  the  mean  value  of  the  process),  approaches  a 
Poisson  distribution 

(5) 

This  observation  suggests  that  one  can  mimic 
ion  implantation  by  drawing  a  random  number  k 
from  the  Poisson  distribution  itself  and  uniformly 
distribute  these  k  impurity  atoms  within  the 
discrete  doping  region  of  the  device  by  using 
triplets  of  independent  uniformly  distributed  ran¬ 
dom  numbers.  This  simplification  is  essential  when 
dealing  with  devices  with  wider  gates  and  very  high 
substrate  doping. 

3.  RESULTS  AND  CONCLUSIONS 

The  test  device  used  for  our  convergence  studies 
has  0.1  pm  gate  length  and  0.2  pm  gate  width.  The 
nonuniform  finite-difference  grid  used  in  these 
simulations  has  55x40x55  grid  points  along  the  x 
(channel  length),  y  (depth)  and  z  (width)  axis.  The 
predetermined  threshold  for  self-consistency  for 
the  electrostatic  potential  is  10~5  thermal  voltages. 

In  Figure  2,  we  compare  the  convergence 
properties  of  various  Poisson’s  equation  solvers 
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Iteration  Number 

FIGURE  2  Convergence  properties  of  the  3D  Poisson 
equation  solvers. 

for  Fs=  FD  =  KbS—  0 V  and  FG=1V.  From  the 
results  shown,  it  follows  that  preconditioned  Bi- 
CGSTAB  method  is  more  economical  than  the 
ILU  and  SIP  methods.  It  gains  very  little  when 
using  more  than  two  internal  loops.  The  use  of  the 
ILU  preconditioner  reduced  the  number  of  itera¬ 
tions  by  approximately  a  factor  of  two.  The  slope 
of  these  curves  is  another  indication  of  the 
performance  of  each  solver. 

The  convergence  behavior  of  our  continuity 
equation  solvers  is  shown  in  Figure  3.  The  applied 
bias  used  in  these  simulations  is  Fs  —  FBs  =  0  V, 
FD=10  mV  and  FG=1  V.  Again  ,  the  precondi¬ 
tioned  Bi-CGSTAB  method  shows  superior  con¬ 


FIGURE  3  Convergence  properties  of  the  3D  electron 
current  continuity  equation  solvers. 


vergence  behavior.  Gummel’s  decoupling  scheme 
converged  in  21  (13)  cycles  when  using  Bi-CGS¬ 
TAB  (ILU)  method.  However,  the  computational 
work  within  each  Gummel  cycle  for  the  ILU 
method  is  more  than  200  iterations,  so  that  the 
total  simulation  time  for  achieving  self-consistency 
using  this  method  is  significantly  longer. 

We  also  studied  the  width  dependence  of  the 
threshold  voltage  Fth  in  scaled  Si  MOSFETs. 
Here,  we  use  0.1  pm  gate-length  and  0.05,  0.1,  0.15 
and  0.2  pm  gate-widths.  Statistical  averaging  of  the 
results  obtained  over  25  different  devices  was 
performed.  The  doping  profile  of  each  device  is 
evaluated  using  our  simplified  procedure.  Using 
the  bisection  method,  we  determine  the  value  of 
the  gate  voltage  VG=  Vth  for  which  the  drain 
current  equals  /D  =  /0ff=  10“10  A.  A  set  of  20 
transfer  characteristics  for  devices  with  Wg  =  02 
pm  which  have  different  number  and  distribution 
of  impurity  atoms  under  the  gate  is  shown  in 
Figure  4.  The  continuum  doping  model  results  are 
also  shown  in  this  figure.  For  the  devices  with 
discrete  doping  distribution  under  the  gate,  the 
subthreshold  slope  does  not  vary  significantly 
from  device  to  device,  and  is  78.06±1.75mV/ 
decade.  Even  though  the  gate- width  of  these 
devices  is  4  times  larger  than  that  used  in  [1],  we 
still  observe  a  significant  spread  of  the  transfer 


Gate  Voltage  VG  [V] 

FIGURE  4  MOSFETs  subthreshold  transfer  characteristics. 
In  the  inset  we  show  the  width  dependence  of  the  threshold 
voltage  for  subthreshold  conduction. 
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characteristics  along  the  gate  axis  due  to  the 
nonuniformity  of  the  potential  barrier  which 
allows  for  early  turn-on  of  some  parts  of  the 
channel.  This  observation  is  supported  by  the  fact 
that  the  averaged  threshold  voltage  is  always 
shifted  about  10  mV  to  lower  voltages.  As  shown 
in  the  inset  of  Figure  4,  this  threshold  voltage  shift 
is  essentially  independent  of  the  width  of  the  gate. 

In  conclusion,  for  solving  very  large  systems  of 
algebraic  equations  derived  from  3D  problems, 
iterative  methods  remain  the  only  viable  solution 
technique.  The  greatly  improved  convergence  rates 
achieved  by  preconditioned  conjugate  gradient 
methods  means  that  the  solution  of  such  problems 
is  more  attainable  now  than  ever  before.  For  both 
3D  Poisson  and  3D  electron  current  continuity 
equations,  the  Bi-CGSTAB  method  showed  rela¬ 
tively  smooth  and  superior  convergence  behavior 
when  compared  to  the  ILU  methods. 

Acknowledgments 

The  authors  would  like  to  thank  C.  Ringhofer,  C. 
Gardner  and  H.  Mittelman,  for  valuable  discus¬ 
sions  and  comments.  This  work  was  supported  by 
the  Office  of  Naval  Research. 


References 

[1]  Wong,  H.-S.  and  Taur,  Y.  (1993).  “Three-Dimensional 
Atomictic  Simulation  of  Discrete  Random  Dopant  Dis¬ 
tribution  Effects  in  Sub-0.1  pm  MOSFET’s”,  IEDM 
Technical  Digest ,  705-708. 

[2]  Zhou  ,  J.-R.  and  Ferry,  D.  K.  (1995).  “3D  Simulation  of 
Deep-Submicron  Devices”,  IEEE  Computational  Science 
and  Engineering ,  2,  30-37,  summer. 

[3]  Van  der  Vorst,  H.  A.  (1992).  “Bi-CGSTAB:  A  Fast  and 
Smoothly  Convergent  Variant  of  Bi-CG  for  the  Solution 
of  Nonsymmetric  Linear  Systems”,  SIAM  J.  Sci.  Stat. 
Compute,  13,  631-644. 

[4]  Gadiyak,  G.  V.  and  Obrecht,  M.  S.  (1986).  “The  Use  of 
Factorization  Methods  for  Solving  the  Charge  Transfer 
Equations  in  Semiconductor  Devices”,  Simulation  of 
Semiconductor  Devices  and  Processes :  Proceedings  of  the 
Second  International  Conference ,  147-156,  Ed.  by.  K. 
Board  and  D.  R.  J.  Owen,  Swansea  UK:  Pineridge  Press. 


[5]  Stone,  H.  L.  (1968).  “Iterative  Solution  of  Implicit 
Approximations  of  Multidimensional  Partial  Differential 
Equations”,  SIAM  J.  Numer.  Anal ,  5,  530-558. 

[6]  Gummel,  H.  K.  (1964).  “A  Self-Consistent  Iterative 
Scheme  for  One-Dimensional  Steady  State  Transistor 
Calculations”,  IEEE  Trans.  Electron  Devices ,  ED-11, 
455-465. 

[7]  Sonneveld,  P.  “CGS,  a  Fast  Lanczos-Type  Solver  for 
Nonsymmetric  Linear  Systems”,  SIAM  J.  Sci.  Stat.  Corn- 
put .,  10,  36-52,  January  1989. 

[8]  Van  der  Vorst,  H.  A.  (1989).  “High  .Performance 
Preconditioning”,  SIAM  J.  Sci.  Stat.  Compute  10, 
1174-1185. 

[9]  Eisenstat,  S.  C.  (1981).  “Efficient  Implementation  of  a 
Class  of  Preconditioned  Conjugate  Gradient  Methods”, 
SIAM  J .  Sci.  Stat.  Comput.,  2,  1-4. 

[10]  Picinbono,  B.  (1993).  Random  Signals  and  Systems , 
Englewood  Cliffs,  New  Jersey:  Prentice  Hall. 


Authors’  Biographies 

Dragica  Vasileska  is  Assistant  Professor  at 
Arizona  State  University.  She  received  her  Ph.D. 
degree  in  Electrical  Engineering  from  Arizona 
State  University  in  1995.  Her  research  interests 
include  semiconductor  device  physics,  3D  device 
simulations  and  transport  theory. 

William  J.  Gross  received  the  B.S.  degree  in 
microelectronic  engineering  from  R.I.T.  in  1990 
and  the  M.S.  degree  in  electrical  engineering  from 
Arizona  State  University  in  1994.  He  is  presently 
pursing  his  Ph.D.  in  electrical  engineering  at  ASU 
in  the  simulation  of  deep-submicrometer  MOS- 
FETs. 

Venceslav  Kafedziski  is  currently  working  to¬ 
ward  his  Ph.D.  degree  at  Arizona  State  University. 
His  research  interests  include  digital  communica¬ 
tions,  the  applications  of  information  and  estima¬ 
tion  theory  and  digital  signal  processing  to  wireless 
communications. 

David  K.  Ferry  is  Regents’  Professor  of  Engine¬ 
ering  at  Arizona  State  University.  He  is  a  Fellow 
of  both  the  American  Physical  Society  and  the 
Institute  of  Electrical  and  Electronics  Engineers. 
He  pursues  the  physics  and  modeling  of  nano- 
structured  devices. 


VLSI  DESIGN 

1998,  Vol.  8,  Nos.  (1-4),  pp.  307-312 
Reprints  available  directly  from  the  publisher 
Photocopying  permitted  by  license  only 


©  1998  OPA  (Overseas  Publishers  Association)  N.V. 

Published  by  license  under 
the  Gordon  and  Breach  Science 
Publishers  imprint. 
Printed  in  India. 


Wave  Function  Scarring  Effects  in  Open  Ballistic 

Quantum  Cavities 
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Numerical  simulations  have  been  carried  out  of  the  magneto-transport  and 
corresponding  wave  functions  of  two  quantum  dot  structures,  namely  the  stadium 
and  the  Sinai  billiard.  In  our  simulations,  the  Schrddinger  equation  is  mapped  onto  a 
tight-binding  lattice  by  replacing  derivatives  by  finite  differences,  and  the  conductance  is 
calculated  via  the  Landauer  formula  following  the  application  of  an  iterative  technique 
to  translate  across  the  structure.  In  both  structures,  many  of  the  resonance  features  in 
the  transport  show  scarring,  that  is,  the  amplitude  of  the  corresponding  wave  functions 
is  highly  concentrated  along  underlying  periodic  classical  orbits.  Our  analysis  indicates 
that  certain  periodicities  evident  in  the  magneto-transport  can  be  associated  with 
particular  orbits. 


Keywords:  Transport,  numerical  simulation,  quantum  simulation,  chaos,  periodic  orbits,  finite 
differences 


1.  INTRODUCTION 

Semiconductor  billiards  have  recently  attracted 
considerable  interest  as  a  novel  probe  of  transport 
and  quantum  chaos.  The  devices  examined  typi¬ 
cally  consist  of  a  central  mesoscopic  cavity, 
connected  to  external  reservoirs  by  a  pair  of 
quantum  point  contacts  through  which  current  is 
measured.  The  lithographic  size  of  these  dots  is 
usually  made  much  smaller  than  the  mean  free 
path,  so  that  transport  is  ballistic.  While  it  has 
been  suggested  that  multiple  billiard  scattering 


from  cavity  walls  will  induce  chaotic  behavior  [1], 
recent  studies  of  rectangular  billiards  have  estab¬ 
lished  the  basic  regular  nature  of  the  orbits  [2],  In 
stadium  billiards,  it  is  still  thought  that  chaotic 
behavior  is  induced,  even  in  open  structures  [3,  4]. 
However,  recent  experiments  on  stadium  shaped 
dots  suggest  that  their  electrical  properties  are 
dominated  by  a  characteristic  periodicity  in  their 
magnetoconductance  [3,  5].  In  the  earlier  study,  it 
was  tentatively  suggested  that  this  periodicity 
arises  from  the  remnants  of  regular,  semi-classical 
orbits  within  the  dots  [3].  In  order  to  investigate 
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this  effect,  we  have  modeled  the  dots  by  solving  the 
quantum  mechanical  problem  on  a  discrete  lattice 
using  a  numerically  stabilized  variant  of  the 
transfer  matrix  approach  [6].  The  details  of  this 
method  are  summarized  in  the  next  section.  The 
section  following  presents  our  theoretical  results 
for  the  stadium  as  well  as  a  comparison  with 
experiment.  Simulating  dots  of  comparable  size, 
we  reproduce  the  periodicity  of  the  fluctuations 
found  experimentally.  Moreover,  we  find  that  the 
wave  functions  of  these  structures  can  be  heavily 
scarred  by  periodic  orbits,  with  certain  scars 
recurring  periodically  in  magnetic  field,  in  good 
correspondence  with  selected  peaks  in  the  power 
spectra.  In  the  next  section,  we  consider  the  Sinai 
billiard,  which  consists  of  a  nominal  square 
ballistic  quantum  dot  with  a  circular  anti-dot  in 
the  center  and  is  another  structure  whose  classical 
analog  is  chaotic.  Recent  experiments  [7]  have 
shown  self-similar  (fractal)  behavior  in  the  mag¬ 
netoconductance  of  such  structures,  appearing  in  a 
series  of  peaks  in  the  resistance.  Not  only  do  our 
simulations  show  indications  of  self-similarity  on 
scales  similar  to  that  observed  experimentally,  but 
that  this  behavior  can  in  fact  coexist  with  the 
periodic  scarring  effect  associated  with  regular 
orbits.  The  paper  concludes  with  a  summary. 


2.  METHOD  OF  CALCULATION 

As  displayed  in  Figure  1,  the  typical  situation  is 
one  in  which  ideal  quantum  wires,  which  extend 
outward  to  ±oo  are  connected  to  the  quantum  dot, 
a  stadium  in  this  case.  This  problem  can  be  solved 
by  using  an  iterative  method  [6]  applied  to  the 
discretized  version  of  the  Schrodinger  equation, 
obtained  by  keeping  terms  up  to  first  order  in  the 
approximation  of  the  derivative: 

(EF  —  Hy)'0y  +  Hyj_l'0y_I  +  Wj  =0  (l) 

where  ^  is  a  M-dimensional  vector  containing  the 
amplitudes  of  the  j  th  slice.  The  problem  is  solved 
on  a  square  lattice  of  lattice  constant  a  with  the 


FIGURE  1  The  geometry  of  one  of  the  quantum  dots  consi¬ 
dered  in  this  study.  The  grid  represents  the  underlying  mesh  on 
which  the  calculations  are  performed,  though  in  practice  the 
grid  is  much  finer  that  that  shown  here. 

wires  extending  M  lattice  sites  across  in  the  x 
direction  and  the  region  of  interest  being  broken 
down  into  a  series  of  slices  along  the  y  direction.  In 
this  equation,  the  H,  matrices  represent  Hamilto¬ 
nians  for  individual  slices  and  the  matrices  H {jj_\ 
and  H jj+\  give  the  inter-slice  coupling.  By 
approximating  the  derivative,  the  kinetic  energy 
terms  of  Schrodinger’s  equation  get  mapped  onto 
a  tight-binding  model  with  t  =  —H2  jlnfa 2  repre¬ 
senting  nearest  neighbor  hopping.  The  potential 
simply  adds  to  the  on  site  energies.  Assuming  a 
flux  incident  from  the  left,  this  equation  can  be 
used  to  derive  a  transfer  matrix  which  allows  us  to 
translate  across  the  system  and  thus  calculate  the 
transmission  coefficients  which  enter  the  Land- 
auer-Buttiker  formula  to  give  the  conductance. 
Transfer  matrices  however  are  made  unstable  due 
to  exponentially  growing  and  decaying  contribu¬ 
tions  of  the  evanescent  modes.  This  difficulty  can 
be  overcome  by  performing  some  clever  matrix 
manipulations  and  calculating  the  transmission  by 
a  iterative  procedure  rather  than  just  multiplying 
transfer  matrices  together  [6].  The  amplitudes  of 
the  wave  functions  at  specific  values  of  x  and  y  can 
be  found  easily  by  backward  substitution  after  the 
iteration  is  performed. 

3.  THE  STADIUM 

The  correspondence  between  periodicities  in  the 
conductance  and  scarring  effects  was  first  sug¬ 
gested  as  a  possibility  in  the  context  of  stadium 
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shaped  quantum  dots  by  Marcus  et  al.  [3].  Pro¬ 
vided  an  ensemble  average  was  not  performed, 
they  found  that  a  Fourier  analysis  of  their 
conductance  fluctuations  revealed  the  presence  of 
strong  peaks  at  a  few  discrete  frequencies.  The 
stadium  is  of  particular  interest,  since  its  closed, 
classical  analog  is  well  known  to  be  chaotic  (it 
should  be  noted  that  a  closed  quantum  stadium 
structure  has  also  been  achieved  [7]).  As  such,  one 
might  expect  the  behavior  to  reflect  ergodicity,  or 
phase  space  filling.  Such  behavior  is  only  recovered 
after  ensemble  averaging  over  many  conductance 
traces,  each  obtained  by  altering  the  gate  voltage 
[3,  4].  Importantly,  such  averaging  has  the  effect  of 
returning  the  dot,  and  its  point  contact  leads,  to 
the  classical  regime,  since  the  quantization  is 
averaged  out,  and  the  proper  quantum  behavior 
is  masked. 

The  results  of  a  Fourier  analysis  of  a  single 
conductance  trace  are  shown  in  Figure  2(a)  as 
curve  (a),  taken  from  Ref.  [3],  In  the  inset  is  a 
micrograph  of  the  actual  dot  on  which  the 
experiments  were  performed.  As  shown,  the  input 
and  output  leads  were  at  right  angles  to  each 
other.  The  dot  size  was  approximately  0.4xl.0  pm 
and  the  electron  density  was  3.6  x  1011  cm-2. 
Curve  (b)  is  the  Fourier  power  spectrum  obtained 
from  our  simulation  of  a  similar  sized  dot  (after 
accounting  for  depletion).  Both  the  experiment 
and  the  simulation  show  strikingly  similar  harmo¬ 
nic  content,  with  well  defined  peaks  at  f  ~  20,  33 
and  65  T-1  in  both  cases.  For  the  simulation,  four 
modes  were  allowed  to  propagate  in  the  leads.  In 
addition  to  the  periodicity,  we  see  scarring  effects. 
Figure  2(b)  shows  a  representative  example,  a 
wave  function  scarred  by  an  underlying  “bow-tie” 
shaped  classical  orbit,  which  occurs  at  i?  =  0.229  T. 
Unfortunately,  while  other  scars  do  appear,  it  is 
difficult  to  establish  a  connection  between  any  scar 
and  a  particular  power  spectrum  peak,  as  similar 
looking  scars  can  be  resolved  only  at  a  very  few 
other  values  of  field. 

Figure  3  shows  an  example  in  which  the  stadium 
has  centrally  aligned  leads,  dimensions  of  0.4  pm  x 
0.8  pm  and  an  electron  density  of  4xlOncm"2. 
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FIGURE  2  In  panel  (a),  the  Fourier  power  spectrum  of  the 
conductance  fluctuations  is  plotted.  The  curve  labeled  (a)  is  for 
the  experimental  stadium  dot  shown  in  the  inset  (from  Ref.  [3], 
with  permission).  Curve  (b)  is  the  result  of  the  quantum 
simulation.  Panel  (b)  show  a  “bow-tie”  scar  that  appears  in  the 
stadium  wave  function.  Probability  density  (|^(x,y)|2)  versus  x 
and  y  plotted  here,  with  higher  amplitude  corresponding  to 
darker  shading. 


FIGURE  3  Conductance  fluctuations,  <5g,  are  plotted  as  a 
function  of  magnetic  field  for  a  stadium  quantum  dot  with 
aligned  leads.  The  right  inset  shows  the  corresponding  power 
spectrum.  The  left  inset  shows  a  “whispering  gallery”  scar  that 
recurs  at  the  points  indicated  by  the  solid  circles. 

Here  we  plot  the  conductance  fluctuations  for  this 
configuration  and  the  corresponding  power  spec¬ 
trum  is  shown  in  the  left  inset.  A  single  peak  at  47 
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T-  Clearly  dominates.  The  corresponding  mag¬ 
netic  field  period  is  0.021  T,  in  good  agreement 
with  the  spacings  between  successive  minima  in 
the  fluctuations.  The  solid  circles  correspond  to 
values  of  B  where  a  “whispering-gallery”  scar  was 
observed  in  the  wave  function.  An  example  of  such 
a  scar  is  shown  in  the  left  inset,  which  corresponds 
to  5  =  0.288  T.  The  quantization  of  modes  in  the 
contacts  causes  the  electrons  to  be  injected  in 
collimated  beams  directed  at  well  defined  angles 
[2],  and  it  is  this  effect  that  is  responsible  for 
exciting  the  orbit  seen  here.  The  “whispering- 
gallery”  scars  appear  at  very  nearly  periodic 
intervals,  with  the  period  also  being  0.021  T,  thus 
a  strong  correspondence  can  be  made  between  a 
particular  power  spectrum  peak  and  a  periodic 
orbit  in  this  case. 


4.  THE  SINAI  BILLIARD 

As  mentioned  in  the  introduction,  the  Sinai 
billiard  is  another  example  in  which  the  classical 
dynamics  is  expected  to  be  chaotic,  at  least  in  the 
closed  structure.  The  experimental  observation  of 
“self-similarity”  in  the  magnetoresistance  of  such 
structures  has  been  recently  reported  [8].  Self¬ 
similar  or  fractal  behavior  has  long  been  asso¬ 
ciated  with  chaos.  We  have  performed  simulations 
of  Sinai  billiard  structures,  with  configurations 
similar  to  the  experimental  ones,  with  a  quantum 
dot  1.0  jam  square,  an  anti-dot  in  the  center,  two 
leads  in  the  bottom  left-hand  size  corner  (refer  to 
Fig.  4(c))  and  an  electron  density  of  2.3  x  101 1  cm“2. 
Figure  4(a)  shows  a  resistance  trace  obtained  from 
such  a  simulation,  with  the  leads  adjusted  to 
support  four  propagating  modes,  and  the  central 
anti-dot  approximately  0.2  pm  in  radius.  We  show 
this  example  because,  first  of  all,  it  shows 
indications  of  the  self-similar  behavior  seen  experi¬ 
mentally.  A  blow-up  of  the  central  region  of 
Figure  4(a),  is  displayed  in  Figure  4(b).  Note 
that  the  basic  form  of  two  broad  humps  occurs  on 
both  scales.  The  peak  of  the  hump  in  (a)  occurs  at 
~  10  mT  and  in  (b)  at  ~0.5  mT,  indicating  a 


FIGURE  4  (a)  A  resistance  trace  obtained  from  a  simulation 
of  a  Sinai  billiard,  with  input  and  output  leads  are  in  the  bottom 
left  hand  corner  of  the  structure  (these  are  seen  clearly  in  (d)). 
(b)  A  blow-up  of  the  central  region  of  shown  in  (a),  (c)  The 
scarred  wave  function  at  B=  16  mT,  which  corresponds  to  the 
resonance  indicated  by  the  arrow  in  the  top  frame,  (d)  Another 
diamond  scar  from  the  periodic  series,  this  one  at  B  =  49  mT. 


scaling  factor  of  ~  20,  which  is  in  agreement  with 
that  observed  experimentally  [8].  Secondly,  despite 
this  apparent  self-similar  behavior,  this  example 
also  shows  periodic  scarring  effects  similar  to  that 
observed  for  stadium.  Figure  4(c)  shows  the 
scarred  wave  function  5=16mT,  which  corre¬ 
sponds  to  the  resonance  indicated  by  the  arrow  in 
Figure  4(a).  Note  the  distorted  diamond-shaped 
periodic  orbit  reflected  by  the  wave  function. 
Similar  scarred  wave  functions  appear  at  intervals 
of  ~  8  mT.  Figure  4(d)  shows  another  diamond 
scar  from  this  periodic  series,  this  one  at  B  =  49  mT. 
As  in  the  stadium,  the  excitation  of  this  parti¬ 
cularly  periodic  orbit  can  be  attributed  in  large 
part  due  to  the  collimation  effect  on  the  input 
leads.  In  fact,  similar  scars  have  been  observed  in 
simple  square  quantum  dots  [2].  What  is  interest¬ 
ing  however  in  the  Sinai  billiard  is  that,  the 
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circular  anti-dot  in  the  center  can  in  fact  cause  a 
“shadow”  effect,  which  can  make  configuration 
appear  more  regular  than  the  simple  regular  dot. 
This  is  most  apparent  when  the  collimated  beam 
exiting  the  left  contact  just  grazes  the  anti-dot  and 
is  indicated  by  the  periodic  appearance  of  many 
well  defined  scars.  For  example,  in  a  0.3  pm  square 
dot,  four  examples  of  a  diamond  shaped  orbit  were 
observed  at  periodic  intervals,  with  the  periodic 
scarring  effects  becoming  much  weaker  as  the  dot 
is  made  larger  [2].  In  contrast,  with  a  much  larger 
Sinai  billiard  dot,  we  have  seen  the  diamond 
pattern  recur  as  many  as  six  times.  We  shall  give  a 
much  fuller  account  of  these  results  elsewhere. 


5.  SUMMARY 

Despite  the  fact  that  the  closed,  classical  analogs 
of  the  structures  we  have  studied  are  know  to  be 
chaotic,  we  have  found  that  regular  periodic  orbits 
strongly  influence  the  magneto-transport  of  both 
the  stadium  and  Sinai  billiard  structures.  Evidence 
for  these  orbits  is  seen  in  scarred  wave  functions, 
which  can  recur  periodically  in  magnetic  field.  In 
both  cases,  the  quantization  of  modes  in  the  leads 
is  crucial  in  exciting  the  particular  orbits  evident  in 
the  scarred  wave  functions,  as  this  quantization 
causes  the  electrons  to  be  injected  in  collimated 
beams  directed  at  well  defined  angles.  For  the 
stadium  in  particular,  the  scarring  periodicities 
show  good  correspondence  with  power  spectrum 
peaks,  providing  strong  evidence  that  the  periodi¬ 
cities  evident  in  the  magneto-conductance  fluctua¬ 
tions  are  in  fact  tied  to  specific  periodic  orbits. 
Moreover,  we  obtain  power  spectra  that  agree  well 
with  the  experimental  observations.  In  the  Sinai 
billiard,  we  see  indications  of  the  self-similar 
behavior  observed  experimentally  coexisting  with 
the  periodic  scarring  behavior.  The  actual  origin  of 
the  self-similar  behavior  in  this  structure  requires 
further  study.  However,  in  this  regard,  it  should  be 
mentioned  that  the  results  of  a  semi-classical 
analysis  have  lead  to  the  prediction  of  fractal 
conductance  fluctuations  in  chaotic  cavities,  no¬ 


tably  for  cases  in  which  there  is  a  mixed  (chaotic 
and  regular)  phase  space  [9]. 
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Complete  RF  Analysis  of  Compound  FETs  Based 
on  Transient  Monte  Carlo  Simulation 
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In  this  paper  we  described  a  complete  methodology  to  extract  the  RF  performance  of 
‘real’  compound  FETs  from  time  domain  Ensemble  Monte-Carlo  (EMC)  simulations 
which  can  be  used  for  practical  device  design.  The  methodology  is  based  on  transient 
finite  element  EMC  simulation  of  realistic  device  geometry.  The  extraction  of  the 
terminal  current  is  based  on  the  Ramo-Shockley  theorem.  Parasitic  elements  like  the 
gate  and  contact  resistances  are  included  in  the  RF  analysis  at  the  post-processing  stage. 
Example  of  the  RF  analysis  of  pseudomorphic  HEMTs  illustrates  our  approach. 


Keywords:  Monte-Carlo,  RF  analysis,  compound  FETs,  simulation 


1.  INTRODUCTION 

The  remarkable  RF  performance  of  compound 
FETs  such  as  GaAs  MESFETs  and  InGaAs 
HEMTs  with  channel  lengths  down  to  0.1pm  is 
due  to  well  pronounced  velocity  overshoot.  The 
use  of  simulation  for  predictive  analysis  and 
design  of  such  devices  require  in  many  cases  the 
employment  of  full  scale  EMC  technique  [1-3]. 
However,  most  of  the  published  EMC  studies  of 
compound  FETs  consider  simplified  device  geo¬ 
metry  and  focus  mainly  on  the  transport  physics 
and  the  effect  of  the  enhanced  channel  velocity  on 
the  DC  device  characteristics.  Far  more  impor¬ 
tant  for  the  proper  design  of  modern  short 


channel  compound  FETs  is  the  RF  performance 
which  is  determined  not  only  by  the  high  field 
transport  but  also  by  the  device  geometry  and  the 
surface  effects.  The  T-  or  T-shaped  gate,  the  gate 
recess  and  the  passivation  in  such  devices 
critically  affect  the  device  parasitics  and  the 
overall  RF  device  performance. 

In  this  paper  we  describe  a  methodology  based 
on  the  EMC  simulation  to  investigate  the  RF 
performance  of  FETs.  The  terminal  currents  are 
estimated  using  the  Ramo-Shockley  theorem.  The 
device  parasitics  are  included  through  the  proper 
finite-element  description  of  the  gate  and  recess 
shapes.  The  external  parasitics  are  included  in  the 
post-processing  stage. 
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2.  TRANSIENT  CURRENT 

The  Heterojunction  2D  Finite  Element  simulator 
(H2F)  and  its  Monte-Carlo  module  are  described 
in  detail  elsewhere  [4,  5].  In  the  MC  simulation  the 
total  transient  terminal  current  in  response  to  a 
step  change  in  the  applied  voltages  required  for  the 
^-parameter  extraction  is  the  sum  of  the  particle 
current  and  the  displacement  current.  According 
to  the  Ramo-Shockley  theorem  [6]  the  instanta¬ 
neous  transient  current  in  electrode  i  due  to  N 
discrete  moving  charges  within  the  device  is  given 
by  the  sum  of  the  current  77(f)  solely  contributed 
by  the  movement  of  the  N  charged  particles  with 
fixed  potentials  at  electrodes  and  the  current 
induced  due  to  the  time-varying  potentials  of  the 
electrodes  through  the  capacitive  coupling  through 
the  electrodes.  The  current  77(f)  is  given  by 

=  0-v/,  (i) 

j=  1 

where  qj  is  the  charge  of  the  super-particle,  y,(f)  is 
the  velocity  of  the  particle  and  ft  is  the  solution  of 
the  Laplace  equation  V-(eV/})  =  0  with  a  unit 
voltage  applied  to  electrode  i,  while  all  other 
electrodes  are  grounded. 

The  current  I"{t)  associated  with  the  time 
varying  potential  is  calculated  from  the  capaci¬ 
tance  matrix  components  Cy  associated  with  the 
electrodes  of  the  simulated  device.  The  capacitance 
matrix  components  are  obtained  from  the  solution 
of  the  Laplace  equation  as  Gy  =  AQi/AVj  where 
A Qi  is  the  change  in  the  electrode  charge  in 
response  to  a  change  in  the  potential  of  a 
particular  electrode.  When  a  step  perturbation 
AVj  of  the  terminal  voltage  is  applied  I'!(t)  flows 
only  during  the  first  time  step  At  and  is  given  by: 

m  =  c.,%  <2) 

The  displacement  current  during  the  remaining 
part  of  the  transient  is  related  to  the  charges 
induced  on  the  electrode  by  the  moving  particles  in 


the  device  associated  with  the  redistribution  of  the 
mobile  charge  and  is  accounted  for  by  Eq.  (1). 


3.  RF  ANALYSIS 

The  flow  diagram  of  the  complete  time  domain  RF 
EMC  analysis  is  given  in  Figure  1 .  The  ^-para¬ 
meters  are  calculated  by  Fourier  decomposition  of 
the  current  transients  obtained  in  response  to  step 
perturbations  in  the  terminal  voltages  [7].  The  cut¬ 
off  frequency  of  the  simulated  device  ff*  is 
extracted  by  solving  log[G*m(log/)]  =  0  where 
Gc  =  dl^jdlg  is  the  current  gain  expressed  as  a 
function  of  y-parameters. 

In  order  to  extract  the  maximum  frequency  of 
oscillation  of  the  simulated  device  /™x  the 
y-parameters  are  transformed  into  ^-parameters 
Ssim.  The  maximum  frequency  of  oscillation 


yreai  grea> 


;real 

max 


FIGURE  1  Flow  diagram  of  the  RF  EMC  analysis. 
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is  then  extracted  by  solving  log[M^Gsim(log/)]  =  0 
where  MAGsim  is  the  maximum  available  gain. 

For  typical  MESFET  and  HEMT  simulation 
domains,  we  have  adopted  the  equivalent  circuit 
model  presented  in  Figure  2.  The  gate  resistance, 
the  contact  resistances  and  any  external  inductive 
components  are  excluded  from  the  EMC  simula¬ 
tion.  The  source  Rs\  and  the  drain  Rd\  resistances 
in  Figure  2  represent  the  resistance  of  the  regions 
between  the  gate  and  the  source  and  drain  ohmic 
contacts  respectively.  To  extract  accurately  the 
small  signal  equivalent  circuit  the  ^-parameters  of 
the  simulated  device  Ysimare  transformed  into  z- 
parameters  Zsim.  The  estimated  source  and  drain 
resistances  Rs\  and  Rd\  are  subtracted  from  Zsim  to 
obtain  the  z-parameters  Zmt  of  the  ‘intrinsic’ 
device.  Finally  Zint  are  transformed  back  into  y- 
parameters  Ymt  from  which  the  components  of  the 
‘intrinsic’  small  signal  circuit  can  be  analytically 
extracted  [7]: 

In  order  to  evaluate  the  cut-off  frequency  / ^eal 
and  the  maximum  frequency  of  oscillations  / ^®al  of 
the  ‘real’  device  the  gate  resistance  Rg ,  the  contact 
resistances  Rc  and  eventually  the  inductive  com¬ 
ponents  Lg ,  Ls  and  Ld  first  have  to  be  incorporated 
in  the  z-parameters  of  the  real  device  Zreal  which 
are  then  transformed  into  Yreal.  From  Yreal  and 
the  figures  of  merit  of  the  ‘real’  device  we  can 
estimate  /£eal  and  /^aI.  The  intrinsic  minimum 
noise  figure  is  also  evaluated  from  the  two-port  y- 
parameters  and  the  current  traces. 
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FIGURE  2  Small  signal  equivalent  circuit  of  a  real  FET. 
Figure  also  shows  the  ‘intrinsic*  and  ‘simulated*  device 
equivalent  circuit. 


4.  RESULTS 

We  apply  the  described  RF  analysis  in  the 
simulation  of  a  120  nm  T-gate  InGaAs  channel 
pHEMT  with  22  nm  gate  to  channel  separation  [8], 
The  ^-parameters  extracted  from  the  Fourier 
decomposition  of  the  transients  are  shown  in 
Figures  3  a,b.  Table  I  summarises  the  small  signal 


Frequency  (GHz) 


Frequency  (GHz) 

FIGURE  3  7-parameters  as  a  function  of  frequency  extracted 
from  the  Fourier  decomposition  of  the  current  transients  at 
Vg=-0.2  and  Frf=1.5V. 
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TABLE  I  Small  signal  equivalent  circuit  components  calculated  from  the  EMC  simulations,  Vg  =  -0.2  and  Vd=  1.5  V.  Given  also 
are  the  experimental  values.  fT  and  /max  are  given  at  Rg  =  Rc  =  5  Q. 


Cgs(fF) 

Cgd(fF) 

gmo(ms) 

gds(fF) 

Ri(ft) 

r(pS) 

fT(GHz) 

fmax(GHz) 

MC 

75.5 

14.2 

87.6 

11.2 

2.3 

0.17 

142 

164 

Exp. 

77.8 

8.73 

■al 

67 

8.65 

3.75 

0.32 

114 

150 

FIGURE  4  Effect  of  gate  and  contact  resistances  on  fmSLX. 


FIGURE  5  Intrinsic  noise  figure  for  the  120nm  pHEMT 
extracted  from  the  current  traces  at  Vg=  —  0.2  and  1.5  V. 


equivalent  circuit  parameters  extracted  from  the 
EMC  simulations,  Vg=-0.2  and  Vd=\.5\ ,  to¬ 
gether  with  the  experimental  values.  The  higher 
simulation  gmo  and  fT  values  is  due  to  the  fact  that 
the  MC  simulations  overestimate  the  velocity 
overshoot  in  the  channel.  The  significant  effect 
the  gate  and  contact  resistances  have  on  the 
maximum  frequency  of  oscillation  is  depicted  in 
Figure  4.  The  intrinsic  noise  figure  extracted  from 
the  transients  is  shown  in  Figure  5. 

5.  CONCLUSION 

In  this  paper  we  have  presented  a  comprehensive 
methodology  for  the  RF  analysis  of  FETs.  The 
methodology  is  based  on  the  transient  EMC 
simulations  of  the  intrinsic  device  followed  by  a 
post-processing  stage  during  which  the  parasitic 
elements  are  included.  It  allows  realistic  estimation 
of  the  RF  performance  from  MC  simulations.  The 
capabilities  of  the  scheme  are  illustrated  in 
example  simulations  for  the  120nm  gate  length 
pHEMT. 
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In  this  paper  we  present  a  methodology  to  use  drift  diffusion  (DD)  simulations  in  the 
design  of  short  channel  heterojunction  FETs  (HFETs)  with  well  pronounced  velocity 
overshoot.  In  the  DD  simulations  the  velocity  overshoot  in  the  channel  is  emulated  by 
forcing  the  saturation  velocity  in  the  field  dependent  mobility  model  to  values 
corresponding  to  the  average  velocity  in  the  channel  obtained  from  Monte  Carlo  (MC) 
simulation.  To  illustrate  our  approach  we  compare  enhanced  DD  and  MC  simulation 
results  for  a  pseudomorphic  HEMTs  with  0.12  pm  channel  length,  which  are  in  good 
agreement.  The  usefulness  of  the  described  methodology  is  illustrated  in  a  simulation 
example  of  self  aligned  gamma  gate  pseudomorphic  HEMTs.  The  effect  of  the  gamma 
gate  shape  and  the  self  aligned  contacts  on  the  overall  device  performance  has  been 
investigated. 

Keywords:  Numerical  simulation,  overshot,  heterojunction  FETs,  drift-diffusion  approach 


1.  INTRODUCTION 

Commercial  device  simulators  like  MEDICI  [1] 
and  BLAZE  [2]  are  flexible,  fast  and  work  in  a 
user  friendly  environment.  Employing  finite  ele¬ 
ment  approach  they  can  describe  accurately  the 
complex  geometry  of  modern  short  recess  gate 
heterojunction  FETs  (HFETs).  Interface  charge, 
surface  states,  and  deep  levels  can  be  eventually 
included  in  the  simulations  and  the  self  heating  can 
be  treated  through  coupling  to  the  heat  flow 
equation.  The  transient  algorithms  are  stable  and 


allow  for  large  time  steps.  Frequency  domain 
analysis  is  readily  available.  External  circuit 
elements  like  contact  and  gate  resistances,  pad 
inductances  and  capacitances  can  be  included  in 
the  simulations  and  in  the  rf  analyses.  Unfortu¬ 
nately  the  drift-diffusion  (DD)  approach,  which  is 
at  the  heart  of  such  simulators,  cannot  predict  the 
velocity  overshoot  responsible  for  the  high  perfor¬ 
mance  of  many  short  channel  HEFTs  [3],  The 
hydrodynamic  (HD)  options  offered  as  extensions 
to  the  DD  engines  of  the  above  simulators  deal 
with  the  overshoot  but  slow  down  the  simulations 
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and  often  have  convergence  and  parameter  in- 
dentification  problems.  The  ensemble  Monte 
Carlo  (MC)  method,  usually  implemented  in  in- 
house  software  [4],  is  still  computationally  expen¬ 
sive  and  cannot  match  all  the  features  of  the  DD 
commercial  simulators,  particularly  their  speed. 

In  this  paper  we  describe  a  methodology  for 
using  MC  calibrated  DD  simulation  with  en¬ 
hanced  channel  velocity  in  the  design  and  optimi¬ 
sation  of  short  channel  HFETs  including  HEMTs 
and  strained  Si  channel  SiGe  MODFETs.  The  DD 
simulations  are  calibrated  with  respect  to  our  finite 
element  Monte  Carlo  simulator  H2F  [5].  The 
usefulness  of  this  approach  is  illustrated  in 
simulation  examples  of  self  aligned  gamma  gate 
pseudomorphic  HEMTs. 


Figure  1  illustrates  the  velocity  along  the 
channel  of  a  0.12  p  gate  length  pseudomorphic 
HEMT  with  22  nm  gate-to-channel  separation,  50 
nm  recess  offset  and  delta  doping  in  the  supply 
layer.  The  device  is  described  in  more  details 
elsewhere  [7].  The  velocity  profile  obtained  from 
DD  MEDICI  simulation  with  Vy  forced  to 
2.8xl07cm/s,  /x0=  5000  cm2v_1s_1and  b  =  2 
matches  well  with  the  MC  velocity  profile  in  the 
source  region  and  in  the  channel.  In  the  drain 
region,  where  most  of  the  particle  in  the  MC 
simulation  are  in  the  L-valley,  the  DD  simulation 
overestimates  the  velocity  and  hence  underesti¬ 
mates  the  drain  resistance.  This  however  does  not 
affect  seriously  the  simulated  rf  performance  since 
the  drain  resistance  has  usually  a  weak  influence 
on  the  measured  and  extracted  ^-parameters. 


2.  EMULATION  OF  THE  OVERSHOOT 

The  enhanced  DD  approach  is  based  on  the 
observation  from  MC  simulations  that  in  many 
short  channel  (0.1 -0.2  pm  gate)  HFETs  the 
velocity  overshoot  extends  along  the  whole  high 
field  channel  region.  The  average  velocity  profile  in 
the  channel  obtained  from  MC  simulation  can  be 
emulated  by  increasing  the  saturation  velocity  v,  in 
a  simple,  three  parameter,  silicon-type  field  depen¬ 
dent  mobility  model  [6]  with  velocity  v  is  given  by 


where  E  is  the  electric  field,  p0  is  the  low  field 
mobility  and  I)  is  a  model  parameter.  The 
calibration  is  done  by  adjusting  the  three  para- 
metes  po>  vs  and  b  in  order  to  match  the  MC 
velocity  profile  in  the  source  and  in  the  channel 
region  for  a  bias  point  in  the  middle  of  the  uselful 
part  of  the  device  characteristics.  In  most  of  the 
cases  this  is  enough  to  achieve  satisfactory  agree¬ 
ment  between  the  MC  and  the  DD  simulation 
results  over  the  whole  range  of  applied  voltages  of 
interest. 


3.  COMPARISON  OF  DD  AND  MC  DC 
AND  RF  RESULTS 

In  order  to  justify  the  use  of  the  enhanced  DD 
approach  in  practical  short  channel  HFET  device 
simulation  and  design  we  compere  the  dc  and  the 
rf  results  obtained  from  both  DD  and  MC 
simulations.  The  comparison  is  based  on  the  same 


x ,  pm 

FIGURE  1  Average  velocity  in  the  channel  of  a  0.12  jim 
pHEMT  obtained  from  DD  and  MC  simulation  at  VG  =  0  V 
and  VD=  1.5  V. 
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pseudomorphic  HEMTs  described  in  Section  2. 
Exactly  the  same  simulation  geometry  is  used  in 
both  the  DD  MEDICI’s  and  the  MC  H2F’s 
simulations.  The  source  and  drain  contact  resis¬ 
tances  which  cannot  be  directly  included  in  the 
MC  simulation  are  also  excluded  from  the  DD 
simulations.  The  velocity  profile  is  calibrated  at 
one  dc  bias  point  corresponding  to  VG  =  0V  and 
VD=  1.5 Y.  The  dc  ouput  characteristics  are 
compared  in  Figure  2  and  are  in  remarkably  good 
agreement  in  both  the  low  and  in  the  high  drain 
volatge  regions.  This  is  partially  due  to  the  fact 
that  the  deconfinement  which  cannot  be  properly 
treated  in  the  DD  simulation  does  not  affect 
seriously  the  operation  of  the  simulated  pseudo¬ 
morphic  HEMTs  with  relatively  deep  (In0.3- 
Ga0.7As)  channels. 

The  comparison  of  the  rf  results  obtained  form 
DD  and  MC  simulation  is  based  on  time  domain 
transient  technique  which  is  the  only  available 
choice  in  the  MC  case  [8].  Complex  two  port  y- 
parameters  are  extracted  by  Fourier  transforming 
the  gate  and  drain  current  transients  in  response  to 
small  changes  in  the  gate  and  drain  voltages.  The 
frequency  dependence  of  the  real  and  the  imagin¬ 
ary  parts  of  yn  and  yi2  is  plotted  in  Figure  3.  The 
y-parameters  extracted  from  the  DD  and  MC 
transient  simulations  are  in  good  agreement.  This 


FIGURE  2  Comparison  between  DD  and  MC  simulated 
output  characteristics  of  a  0.12  pm  pHEMT. 
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FIGURE  3  Comparison  between  Y  parameters  extracted 
from  DD  and  MC  simulation  of  a  0.12  pm  pHEMT  at  VG  = 
-  0.2  V  and  VD-  1.5  V.  The  channel  width  is  100  pm. 

agreement  holds  also  for  the  small  signal  circuit 
elements  and  the  cut-off  frequency  fT  extracted 
from  the  y-parameters  and  for  the  maximum 
frequency  of  oscillations  /max  extracted  after 
transforming  the  y-parameters  into  ^-parameters. 

4.  SELF  ALIGNED  GAMMA  GATE 
EXAMPLE 

Calibrated  DD  simulations  can  be  used  confi¬ 
dently  to  investigate  device  geometry  effects  which 
do  not  affect  strongly  the  lateral  field  profile  in  the 
channel  but  are  important  for  the  rf  device 
performance  including  the  gate  and  contact 
shapes,  self-alignment,  cap  layer  modification, 
recess  design  etc.  When  the  lateral  field  profile  is 
altered,  for  example  as  a  result  of  channel  length 
scaling,  the  DD  simulations  have  to  be  recali¬ 
brated  with  respect  to  a  new  MC  calculation. 

We  illustrate  the  described  DD  approach  in  the 
simulation  of  self  aligned  gamma  gate  pHEMTs. 
The  vertical  layer  structure  of  these  devices  is  the 
same  as  the  structure  described  in  Section  2.  The 
profile  of  the  gamma  gate  and  the  corresponding 
solution  domain  are  outlined  in  Figure  4.  In  the 
DD  simulation  we  investigate  the  effect  of  the  gate 
overlap  dc_r  on  the  drain  side  of  the  channel  on  fT 
and  /max.  The  results  are  presented  in  Figure  5. 
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FIGURE  4  Self  aligned  gamma  gate  pHEMT. 


design  itself  is  a  trade  off  between  the  reduced  gate 
resistance  and  the  increased  gate-to-drain  parasitic 
capacitance. 


5.  CONCLUSIONS 

The  velocity  overshoot  in  the  channel  of  modern 
short  gate  length  HFETs  can  be  satisfactory 
emulated  in  DD  simulations  by  increasing  the 
saturation  velocity  in  a  simple  silicon  type  field 
dependent  mobility  model.  Usuallly  calibration  of 
the  DD  simulations  to  one  properly  selected  bias 
point  of  the  corresponding  MC  simulations  leads 
to  good  agreement  in  the  DD  and  MC  dc  and  rf 
characteristics  over  a  wide  range  of  applied 
voltages.  Calibrated  DD  simulations  can  be  used 
to  speed  up  the  design  of  modern  short  channel 
HFETs  where  geometry  related  parasitic  effects 
complete  with  the  enhanced  channel  transport  in 
determining  the  overall  device  performance. 


FIGURE  5  Dependence  of  fT  and  /max  on  the  gate  overlap 
dc-r  for  the  self  aligned  gamma  gate  pHEMT  illustrated  in 
Figure  4  with  channel  width  is  100  pm,  contact  resistance  1  O 
and  gate  resistance  5  U.  DD  simulations. 


In  the  self  aligned  technology  a  trade-off  between 
the  reduced  series  resistances  and  the  increased 
gate-to-contacts  parasitic  capacitances  determines 
the  over  all  device  performance.  The  gamma  gate 
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RF  Performance  of  Si/SiGe  MODFETs: 
A  Simulation  Study 

S.  ROY,  A.  ASENOV*,  S.  BABIKER  ,  J.  R.  BARKER  and  S.  P.  BEAUMONT 

Nanoelectronics  Research  Centre,  Department  of  Electronics  and  Electrical  Engineering , 
Glasgow  University,  Glasgow  G12  8LT,  Scotland,  UK 

The  microwave  performance  potential  of  Si/SiGe  pseudomorphic  MODFETs  are 
studied,  in  comparison  to  state  of  the  art  InGaAs  pseudomorphic  HEMTs.  Both  devices 
have  equivalent  structures  corresponding  to  a  physical  HEMT  used  for  calibration.  We 
use  an  RF  analysis  technique  based  on  transient  Monte  Carlo  simulations  to  estimate 
the  intrinsic  noise  figures,  the  RF  figures  of  merit  fT  andfmSLXi  and  the  effect  of  contact 
and  gate  resistances.  Both  devices  exhibit  velocity  overshoot  below  the  gate  region.  It  is 
shown  that  the  difference  in  noise  figures  and  /t  values  can  be  mainly  attributed  to 
differences  in  device  channel  velocity.  /max  exhibits  a  strong  dependence  on  device 
contact  resistance,  eroding  some  of  the  performance  advantage  of  the  pseudomorphic 
HEMT. 

Keywords:  Si/SiGe,  heterostructures,  Monte  Carlo,  microwave  performance,  RF  analysis 


INTRODUCTION  doped  field  effect  transistors  (MODFETs)  based 

on  this  material  system  have  been  demonstrated, 
Recent  theoretical  and  experimental  studies  show  and  show  significant  potential  for  RF  applications 

that  low  field  mobility  and  velocity  overshoot  are  [2, 3].  Although  measured  mobilities  are  lower 

enhanced  in  Si  layers  grown  pseudomorphically  on  than  those  reported  in  optimal  III-V  based 

relaxed  SiGe  substrates.  Induced  strain  breaks  the  devices,  the  compatibility  between  SiGe  and 

six-fold  degeneracy  of  the  Si  conduction  band,  conventional  Si  processing  technology  makes  such 

resulting  in  an  improved  band  offset  for  the  two  MODFETs  attractive  for  Si  MMICs  design  and 

conduction  valleys  whose  transverse  effective  mass  microwave  signal  processing  applications  inte- 

is  in  the  plane  of  the  heterojunction.  This  increases  grated  on  conventional  Si  chips.  Therefore, 

the  in  plane  effective  mobility  and  reduces  inter-  comparison  of  strained  Si  channel  devices  with 

valley  scattering  in  the  Si  layer  [1].  Modulation  well  established  members  of  the  III-V  family  will 
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provide  greater  insight  into  their  potential  perfor¬ 
mance  and  limitations. 

As  part  of  this  goal,  we  here  study  the  micro- 
wave  performance  potential  of  a  Si/SiGe  pseudo- 
morphic  MODFET  in  comparison  with  a  state  of 
the  art  InGaAs  channel  pseudomorphic  HEMT, 
using  an  RF  analysis  technique  based  on  transient 
Monte  Carlo  simulations. 


DEVICE  STRUCTURES 

AND  PERFORMANCE  ANALYSIS 

To  allow  a  fair  comparison  similar  device  layer 
structures  are  considered  (Figs.  la,b).  Both  devices 
have  5-doping  separated  by  a  2.5  nm  spacer  from 
the  channel,  a  T  shape  recess  gate  with  50  nm 
recess  offset  and  heavily  doped  (w-type  1018cm"3) 
cap  layers.  An  effective  5-doping  of  5x  1012 cm"2  is 
considered  in  the  both  cases.  Total  gate-to-channel 
separation  is  22  nm  and  the  gate  length  is  0.12  pm. 
p-type  background  substrate  doping  of  10I4cm"3 
is  considered.  This  corresponds  to  the  dimensions 
of  real  pseudomorphic  HEMT  fabricated  at  the 
Glasgow  Nanoelectronics  Research  Centre  and 
used  for  validation  and  calibration  of  the  RF 
Monte  Carlo  analysis  technique  (described  in  an 
accompanying  paper  [4]).  We  realise  that  for  the 
Si/SiGe  MODFET  this  may  pose  unresolved 
growth  problems.  Specifically  we  conjecture  im- 
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FIGURE  1  Vertical  layer  structure  of  (a)  a  0.12  nm 
pseudomorphic  HEMT  and  (b)  a  0.12  pm  pseudomorphic 
MODFET.  Both  devices  have  the  same  vertical  dimensions. 


provements  in  growth  technology  and  low  tem¬ 
perature  processing  to  allow  formation  of  a  well 
defined  As  5-doping  supply  layer.  For  simplicity  of 
Monte  Carlo  modelling  a  uniform  SiGe  substrate 
is  assumed,  instead  of  a  ‘virtual  substrate’  with 
graded  Ge  concentration.  The  ‘etch  stop’  region  of 
the  Si/SiGe  device  is  also  considered  as  (undoped) 
SiGe.  Simple  1-D  Poisson  calculations  indicate 
that  if  a  strained  Si  ‘etch  stop’  region  is  included,  it 
will  exhibit  negligible  parallel  conductance  at  a  d.c. 
bias  of  VG  =  -0.25  V. 

RF  analysis  of  HEMT  and  MODFET  perfor¬ 
mance  is  based  on  Monte  Carlo  simulation  of 
device  transient  response.  We  follow  Yamada’s  [1] 
treatment  of  the  effect  of  the  strain  on  the  Si 
channel  band  structure,  with  conduction  band 
splitting  of  A Ec  =  0.67x  (eV)  and  band  gap  Eg  = 
1-11  —  0.74x  (eV).  However  the  six  phonon  scatter¬ 
ing  model  of  Jacoboni  [5]  is  implemented,  instead 
of  the  four  phonon  model  in  [1].  Acoustic  and 
ionised  impurity  scattering  modes  are  also  in¬ 
cluded,  with  ionised  impurity  scattering  calculated 
from  the  Brooks-Herring  model  [6].  Bulk  velocity- 
field  characteristics  obtained  from  the  model  are 
in  agreement  with  experimental  unstrained  Si  data 
at  77  K  and  300  K  and  previous  results  for  strained 
Si  [1,7].  The  transient  Monte  Carlo  simulations 
begin  by  following  5xl04  superparticles  for  2ps 
settling  time  at  d.c.  bias,  then  a  further  2.0— >2.6  ps 
during  which  device  statistics  are  recorded  at  1  fs 
intervals.  A  step  change  AVG  =  0.2  V,  or  AFD  = 
0.3  V  is  then  consecutively  applied,  and  the 
transient  response  measured  for  a  further  2  ps.  It 
is  found  that  structure  and  doping  dependant  THz 
oscillations  in  device  drain  current  (possibly  due  to 
plasma  oscillations  in  the  channel,  or  in  the  heavily 
doped  cap  layer)  may  mask  the  detailed  form  of  the 
transients,  and  so  a  number  of  traces  are  averaged  to 
define  the  response.  Complex  j-parameters  are 
derived  by  Fourier  transforming  these  terminal 
current  transients,  and  used  to  extract  the  small 
signal  equivalent  circuit,  intrinsic  noise  figures,  and 
estimate  the  RF  performance  figures  of  merit 
fr  (cutoff  frequency)  &ndfmax  (maximum  frequency 
of  oscillation).  Finally,  the  small  signal  equivalent 
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circuit  is  augmented  by  the  addition  of  external 
impedance,  and  thus  the  effect  of  contact  and  gate 
resistance  on  the  ‘real’  device  operation  estimated. 
A  detailed  description  of  the  analysis  process  is 
given  in  [4]. 


RESULTS  AND  DISCUSSION 

The  average  velocity  in  the  channel  of  both  devices 
is  compared  in  Figure  2  for  a  d.c  bias  of  VD  =  1.5  V 
and  Fg  =  -0.25  V.  In  the  pseudomorphic  HEMT 
this  corresponds  to  the  region  of  maximum 
transconductance,  while  in  the  pseudomorphic 
MODFET  it  is  the  region  of  maximum  transcon¬ 
ductance  achieved  while  constraining  parallel 
conduction  in  the  <5-doped  region  to  less  than 
10%  of  total  conduction.  The  gate  extends  from 
x=  -0.12  — >0.0 pm.  Both  devices  show  distinct 
overshoot  below  the  gate  region.  Peak  velocity  in 
the  pseudomorphic  MODFET,  at  the  drain  end  of 
the  strained  Si  channel,  approaches  twice  the 
saturation  velocity  of  bulk  unstrained  Si.  The 


ratio  between  peak  velocity  in  the  MODFET 
compared  to  that  of  the  pseudomorphic  HEMT 
InGaAs  channel  is  2.8. 

Figures  3-5  and  Table  I  characterise  the  RF 
performance  of  the  two  devices.  The  intrinsic  noise 
figures  NF  (in  decibels)  are  shown  in  Figure  3.  From 
the  definition  of  NF  we  postulate  that  the  significant 
difference  between  the  devices  will  be  in  their 
transconductance  gmo  =  Y2\.  All  other  terms  -  y- 
parameters  and  current  fluctuations  <SI2>  -  we 
therefore  represent  by  an  approximate  constant  C. 


NF  —  1  + 


'In TiM  > 
y2i\  <sq> 


From  Figure  3  and  equation  (1)  the  ratio  of 
transconductances  ^hemt/^modfet  is  indeed 
found  to  be  approximately  constant  below  60 
GHz,  with  value  2.6.  The  transconductance  values 
directly  obtained  from  the  Monte  Carlo  simula¬ 
tions  are  ^hemt  ~  87  and  £modfet  =  31,  with  ratio 
2.8.  It  can  be  concluded  that  the  bulk  of  the 


FIGURE  2  Comparison  between  average  velocity  in  the 
channel  of  0.12  pm  pseudomorphic  HEMT  and  MODFET. 
Kd  =  1.5V,  with  VG  =  -0.25  V  corresponding  to  maximum 
transconductance. 


FIGURE  3  Comparison  between  minimum  intrinsic  noise 
figures  for  pseudomorphic  HEMT  and  MODFET  as  a  function 
of  frequency.  VD  =  1 .5  V,  with  VG  ~  -0.25  V. 
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Source,  Drain  Contact  Resistance  (Q) 

FIGURE  4  Variation  of fT  in  a  pseudomorphic  MODFET  as 
a  function  of  source  and  drain  contact  resistance  (/T  invariant 
to  changes  in  gate  resistance). 


FIGURE  5  Variation  of  /max  in  a  pseudomorphic  MODFET 
as  a  function  of  gate  and  contact  resistances. 


difference  in  noise  figures  between  the  two  devices 
can  be  attributed  to  the  differences  in  transcon¬ 
ductance  -  and  thus  to  differences  in  channel 
velocity. 


TABLE  I  Calculated  RF  figures  of  merit  for  pseudomorphic 
HEMT  and  MODFET  at  bias  KD=1.5V,  VG=  -0.25V. 
Device  width  of  100  pm  is  assumed.  /T,  /max  calculated  for 
both  negligible  external  gate  and  drain  contact  resistance,  and 
for  more  realistic  contact  resistances  of  5Q 

RF  figure  of  Pseudomorphic  Pseudomorphic 

merit  (GHz) _ HEMT _ MODFET 

with  negligible  contact  resistance 
fT  192  77.4 

/max  634  161 

gate,  source  &  drain  resistances  all  of  5  H 
/T  156  63.9 

/max  169  68.0 


Table  I  lists  the  RF  figures  of  merit  extracted 
from  the  transient  response  of  each  device.  The 
ratio  of  cut-off  frequencies  /T  for  the  pseudo¬ 
morphic  HEMT  and  MODFET  is  approximately 
2.5  and  independent  of  external  resistance.  This 
implies  that  the  value  of  /T  is  primarily  governed 
by  the  channel  velocity  of  the  respective  device. 
However,  the  ratio  of /max  for  the  pseudomorphic 
HEMT  and  MODFET  is  3.9  for  intrinsic  devices, 
and  drops  to  2.5  when  external  contact  resistances 
of  50  are  applied.  A  more  realistic  inclusion  of 
device  parasitics  in  this  case  reduces  the  advan¬ 
tages  of  the  pseudomorphic  HEMT.  Finally, 
Figures  4  and  5  detail  the  effect  of  varying  contact 
resistance  on  both  figures  of  merit  for  the 
pseudomorphic  MODFET,  and  show  clearly  the 
strong  dependence  of /max  on  device  parasitics. 

CONCLUSIONS 

The  microwave  performance  potential  of  a  Si/SiGe 
pseudomorphic  MODFET  was  studied,  in  com¬ 
parison  to  a  state  of  the  art  InGaAs  channel 
pseudomorphic  HEMT.  We  used  an  RF  analysis 
technique  based  on  transient  Monte  Carlo  simula¬ 
tions,  calibrated  to  a  physical  HEMT.  The 
difference  in  device  noise  figures  was  attributed 
mainly  to  differences  in  device  transconductance, 
and  thus  to  differences  in  channel  velocity.  The 
cut-off  frequency  /T  was  also  shown  to  be 
governed  by  channel  velocity.  However  the  max- 
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imum  frequency  of  oscillation  /max  was  seen  to 
exhibit  a  strong  dependence  on  device  contact 
resistance,  eroding  some  of  the  advantage  of  the 
pseudomorphic  HEMT  over  the  Si/SiGe  MOD- 
FET. 
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Ab-initio  Coulomb  Scattering  in  Atomistic 
Device  Simulation 
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As  devices  shrink  to  dimensions  below  0.1  pm,  it  becomes  essential  to  treat  impurities 
and  carriers  as  individual  charges.  We  describe  some  approaches  to  ionised  impurity 
scattering  where  the  potential  of  the  impurities  is  included  directly  in  the  dynamics  of  a 
Monto  Carlo  type  simulation  rather  than  as  a  scattering  rate.  The  divergence  in  the 
Coulomb  potential  creates  difficulties  for  mesh-based  solutions  of  Poisson’s  equation, 
which  we  have  compared  with  more  accurate  Ewald  summation.  However,  we  find  that 
the  mesh  does  not  introduce  significant  errors,  and  reproduces  well  the  expected 
mobility  as  a  function  of  doping.  Highly  accurate  integration  of  the  equation  of  motion 
is  needed  for  free  carriers,  and  the  initial  distribution  is  problematic.  In  contrast,  a 
simple  treatment  of  phonon  scattering  by  Brownian  dynamics  is  more  tolerant  of  errors 
because  it  tends  to  restore  the  system  to  equilibrium. 


Keywords:  Semiconductors,  discrete  charges,  numerical  simulation,  Monte  Carlo,  mobility 


1.  INTRODUCTION 

As  semiconductor  devices  shrink  to  dimensions 
below  0.1  pm,  their  properties  begin  to  differ  from 
larger  structures  as  the  atomistic  nature  of  the 
charges  begins  to  exert  its  influence  [1].  The 
simulation  of  such  devices  requires  a  full-scale 
3D  treatment,  including  both  impurities  and 
carriers  as  individual  particles  [2, 3].  This  presents 
a  severe  computational  load,  both  in  the  solution 
of  Poission’s  equation  and  because  the  equation  of 
motion  must  be  integrated  through  a  complex 


potential  landscape.  Rapid  variation  of  the  poten¬ 
tial  in  space  may  create  difficulties  with  mesh- 
based  calculations,  both  for  the  dynamics  and 
Poisson’s  equation. 

In  this  paper  we  discuss  some  issues  associated 
with  the  ab-initio  treatment  of  ionised  impurity 
scattering,  where  the  potential  of  the  impurities  is 
included  directly  in  the  dynamics  of  a  Monte  Carlo 
simulation  rather  than  being  introduced  as  a 
scattering  rate.  We  have  studied  several  methods 
for  the  solution  of  Poisson’s  equation  and  the 
integration  of  the  equation  of  motion  to  evaluate 
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their  efficiency  and  accuracy.  For  Poisson’s  equa¬ 
tion  we  compared  a  mesh-based  solution  with  a 
near-exact  potential  based  on  Ewald  summation 
[4].  The  equation  of  motion  was  integrated  either 
with  a  high-order  adaptive  Runge-Kutta  scheme, 
for  free  carriers  and  impurities  alone,  or  with  a 
low-order  Euler  method  within  Brownian  dy¬ 
namics  [5],  which  also  includes  phonons  and  proves 
to  be  more  forgiving. 

These  methods  were  tested  by  calculating  the 
mobility  of  electrons  in  a  slab  with  randomly 
distributed  impurities.  Their  application  is  also 
illustrated  with  the  simulation  of  a  0.08  pm 
channel  length  dual  gate  GaAs  MESFET. 

2.  COMPUTATIONAL  ASPECTS 

The  simplest  approach  to  include  individual 
impurity  charges  in  a  particle-based  device  simula¬ 
tion  is  to  ascribe  them  to  a  mesh  and  to  solve 
Poisson’s  equation  as  a  boundary  value  problem. 
This  method  is  efficient  and  works  for  devices  with 
arbitrary  boundaries.  Unfortunately,  difficulties 
arise  close  to  charges  where  the  1/r  Coulomb 
potential  diverges.  The  errors  in  calculating  the 
interparticle  forces  from  a  mesh-based  solution  of 
Poisson’s  equation  along  the  major  axis  of 
symmetry  of  a  cubic  mesh  cell  are  illustated  in 
Figure  1.  The  errors  are  significant  when  the 
distance  between  the  charges  becomes  smaller  than 
2-3  mesh  spacings.  It  is  practical  to  correct  the 
short-range  part  of  the  interaction  estimated  from 
the  mesh  analytically.  For  a  3D  solution  of  Pois¬ 
son’s  equation  on  an  uniform  mesh  we  use  a  simple 
linear  interpolation  for  the  corrections  which 
reflects  the  symmetry  of  the  mesh  cell. 

Direct  analytic  evaluation  of  the  Coulomb 
forces  between  point  charges  can  give  accurate 
results  but  becomes  to  costly  when  large  numbers 
of  particles  are  involved  in  the  simulation,  or 
where  the  boundaries  of  the  device  are  complex. 
For  simple  cases,  with  periodic  boundary  condi¬ 
tions,  the  method  of  Ewald  summation  [4]  can  be 
used.  We  have  used  a  polynomial  approximation 


Separation  (mesh  spacings) 

FIGURE  1  The  forces  between  two  point  charges  on  along 
the  main  axes  of  symmetry  of  a  cubic  mesh  cell.  The  true 
Coulomb  force  has  been  shown  for  comparison. 

to  the  Ewald  sums  with  fitted  coefficients  and 
compared  the  results  with  mesh-based  solution  of 
Poisson’s  equation. 

Integration  of  the  equations  of  motion  in 
the  complex  potential  landscape  associated  with 
the  individual  charges  poses  problems  because  the 
magnitude  of  the  Coulomb  force  varies  rapidly 
near  the  impurities.  This  necessitates  the  use  of 
high-order  integration  schemes  and  small  time- 
steps  for  free  carriers.  In  this  case  we  use  an 
adaptive  Runge-Kutta  approach.  Errors  accumu¬ 
late  during  the  simulation  and  tight  tolerances  are 
therefore  required. 

Phonon  scattering  can  be  included  using  Brow¬ 
nian  dynamics  [5].  This  is  essentially  an  ensemble 
Monte  Carlo  method  with  a  greatly  simplified 
scattering  term.  Low-order  Euler  integration  of 
the  equation  of  motion  is  satisfactory  in  this  case 
because  the  interaction  with  the  phonons  built  into 
Brownian  dynamics  tends  to  erase  errors  in  the 
integration  of  forces. 

3.  IONISED  IMPURITY  SCATTERING 

The  methods  discussed  above  have  been  tested  by 
calculating  the  mobility  of  electrons  in  bulk  GaAs 
at  room  temperature.  The  crystal  was  represented 
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as  a  periodic  array  of  cubic  cells  and  ionised 
impurity  scattering  was  introduced  through  the 
potential  of  randomly  distributed  impurities  in 
each  cell.  A  uniform  background  charge  was  added 
to  maintain  overall  neutrality.  Poisson’s  equation 
was  solved  using  a  multigrid  technique  for  the 
mesh-based  methods.  A  typical  pontential  distribu¬ 
tion  in  a  (50  nm)3  cell  is  given  in  Figure  2.  The  same 
positions  of  impurities  were  used  in  the  Ewald 
summation  approach. 

The  mesh  spacing  must  not  exceed  the  distance 
at  which  the  Coulomb  interaction  equals  the 
thermal  energy  -  about  2.8  nm  at  room  tempera¬ 
ture.  In  our  calculations  we  used  a  1  nm  mesh.  It  is 
natural  to  assume  that  the  cell  size  has  to  be  larger 
than  the  mean  free  path  of  the  carriers.  For  doping 
concentration  ND  =  1024  m-3  the  mean  free  path  is 
approximately  50  nm  and  a  mesh  with  more  than 
503  points  is  therefore  needed.  This  introduces  a 
problem  for  lower  concentrations  where  the  mesh 
has  to  be  even  larger.  However,  experiments  for 
Nd=  1024  m-3  with  meshes  ranging  from  103  to 
2003  points  gave  a  very  small  spread  in  the 
calculated  mobility,  (in  =  0.417 ±0.008 m2/Vs. 
The  history  of  65000  particles  was  followed  in 
the  simulation  and  the  diffusion  coefficient  was 
calculated  from  the  variance  in  the  range  using  a 


Threshold  equipotential 


FIGURE  2  Random  potential  in  a  uniformly  doped  slab  near 
the  percolation  threshold. 


least-squares  fit.  The  mobility  was  then  deduced 
from  the  Einstein  relation. 

The  results  are  summarised  in  Figure  3  and 
compared  with  the  most  commonly  used  analytical 
formulas.  Phonon  scattering  was  excluded  from  the 
Brownian  simulations  using  Mattheisen’s  rule 
where  necessary.  The  Brownian  dynamics  simula¬ 
tion  with  a  mesh-based  solution  of  Poisson’s 
equation  is  in  good  agreement  with  the  Conwell- 
Weisskopf  formula  over  the  whole  range  of  doping. 
Brownian  dynamics  in  combination  with  the  more 
accurate  Ewald  sum  for  Poisson’s  equation  gives 
lower  mobility  than  the  mesh-based  results.  The 
results  for  the  Ewald  summation  and  free  carrier 
dynamics  show  a  marked  reduction  in  mobility 
which  is  more  pronounced  as  the  doping  level  is 
increased.  The  most  likely  cause  of  this  pheno¬ 
menon  is  the  trapping  of  carriers  in  the  potential 
well  at  the  site  of  each  impurity.  The  lack  of  phonon 
scattering  means  that  is  no  mechanism  for  the 
electrons  to  exchange  energy  with  the  crystal  and 
reach  thermal  equilibrium.  It  is  therefore  important 
to  impose  the  correct  initial  distribution  of  carriers. 
The  use  of  an  unmodified  Maxwell -Boltzmann 
distribution  to  initiate  the  simulation  would  pose 
severe  problems  because  it  predicts  that  all  carriers 
are  trapped  in  the  Coulomb  wells. 
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4.  DUAL  GATE  MESFET  EXAMPLE 

The  Brownian  method  on  a  mesh  was  used  in  a 
simple  simulation  of  an  80  nm  dual-gate  MESFET 
with  N/)  =  5xl023m-3  in  the  channel.  Figure  4 
illustrates  the  potential  in  the  channel  near  thresh¬ 
old;  fluctuations  due  to  the  random  donors  are 
evident.  Figure  5  shows  that  the  results  of  this 
atomistic  simulation  agree  well  with  drift-diffusion 
results  from  MEDICI,  at  low  source-drain  vol¬ 
tage.  The  slope  of  the  7/>(FG)  characteristic  is 
proportional  to  the  mobility,  and  shows  that 
impurity  scattering  is  properly  included  in  the 
Brownian  simulation. 


5.  CONCLUSIONS 

We  have  considered  different  approaches  to  the 
solution  of  both  Poisson’s  equation  and  the 
equation  of  motion  in  a  simulation  where  ionized 
impurities  are  treated  as  discrete  random  charges. 
We  conclude  that  the  simpler  mesh-based  solution 
of  Poisson’s  equation  does  not  introduce  signifi¬ 


FIGURE  4  Equipotential  surfaces  in  the  channel  of  80  nm 
channel  dual  gate  GaAs  MESFET  at  threshold. 


VgM 


FIGURE  5  Transfer  characteristics  of  an  80  nm  channel  dual 
gate  GaAs  MESFET,  comparing  the  “atomistic”  Brownian 
simulations  and  drift-diffusion  MEDICI  simulations  at  low 
drain  voltage  (VD  =  0.05V). 


cant  errors,  and  reproduces  well  the  expected 
mobility  as  a  function  of  doping.  Highly  accurate 
integration  of  the  equation  of  motion  is  needed  for 
free  carriers,  but  the  interaction  with  phonons  in 
the  Brownian  method  makes  it  more  tolerant  of 
errors. 
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We  introduce  an  iterative  scheme  to  solve  the  drift-diffusion  device  simulation  problem, 
which  combines  the  Gummel  iteration  with  the  “pointwise  iteration”,  and  then  we 
compare  its  convergence  behavior  with  other  iteration  strategies,  for  different  test  cases. 
Comparisons  are  made  with  the  standard  Gummel  approach  and  a  nonlinear  multigrid 
iteration.  The  combined  “Gummel-Pointwise”  iteration  has  significantly  better 
convergence  characteristics  than  the  Gummel  iteration  in  all  cases.  The  cost  of  the 
pointwise  iteration  varies  only  linearly  with  the  number  of  grid  points.  While  in  terms  of 
CPU  time  for  the  solution  the  pointwise  iteration  is  not  always  faster  in  2-D,  in  3-D  the 
combined  technique  becomes  more  and  more  advantageous  as  the  grid  size  increases.  It 
is  found  that  the  nonlinear  multigrid  scheme  is  overall  not  as  effective  as  the  combined 
iteration. 


Keywords:  Drift-diffusion,  iterative  methods,  gummel  iteration 


1  INTRODUCTION 

A  major  problem  for  the  development  of  drift- 
diffusion  solvers  for  3-D  applications  is  the 
difficulty  in  adapting  existing  2-D  schemes.  The 
Gummel  and  Newton  iteration  methods  are  most 
commonly  used  to  solve  the  nonlinear  steady-state 
problem  arising  from  the  three  semiconductor 
device  equations  [1],  (Poisson  equation,  electron 


continuity  and  hole  continuity,  in  the  order) 

V2^  =  _i{p_n  +  N+_N-)  (!) 

VJ„  — ?*  =  0  (2) 

V-Jp  +  qR  =  0  (3) 

where  all  symbols  have  their  usual  meaning. 
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In  the  case  of  Gummel  iteration,  the  equations 
are  solved  in  a  decoupled  manner.  Poisson 
equation  is  solved  at  all  grid  points,  followed  by 
the  electron  continuity  equation,  and  then  by  the 
hole  continuity  equation,  repeating  the  procedure 
until  convergence.  In  Newton’s  method  the  three 
device  equations  are  solved  simultaneously  using  a 
linearization  procedure.  The  Gummel  iteration 
always  maintains  the  coupling  between  grid 
points,  arising  from  discretization,  while  the 
equations  are  loosely  coupled  through  successive 
updates  of  the  variables  «  and  p.  In  the  Newton 
iteration,  both  the  coupling  between  neighbouring 
grid  points  and  that  between  the  three  equations  is 
maintained  during  each  iteration. 

In  practice,  the  Gummel  method  is  found  to  be 
efficient  if  the  coupling  between  the  equations  is 
weak;  otherwise,  the  convergence  may  become 
quite  slow.  The  Newton  method  is  more  robust, 
and  converges  in  relatively  few  iterations,  provided 
that  the  initial  guess  is  sufficiently  close  to  the 
solution.  The  major  drawback  of  Newton’s 
method  is  that  it  involves  a  matrix  of  size 
3Nx3N  (where  N  is  the  number  of  grid  points), 
which  increases  both  memory  requirements  and 
CPU  time  at  each  iteration.  These  constraints  are 
particularly  severe  in  three  dimensions  if  a  direct 
method  for  matrix  solution  is  to  be  used. 


2  NUMERICAL  APPROACH 

Several  strategies  have  been  reported  to  circum¬ 
vent  the  limitations  of  Gummel’s  and  Newton’s 
iterations  [2-7],  such  as  approaches  for  accelera¬ 
tion  of  convergence  in  Gummel’s  scheme,  reduc¬ 
tion  of  memory  requirements  in  Newton’s  scheme, 
and  so  on.  The  purpose  of  this  paper  is  to  explore 
alternative  iteration  schemes  to  solve  the  device 
equations,  and  to  evaluate  their  performance 
through  numerical  experiments. 

A  complementary  approach  to  Gummel  itera¬ 
tion  is  represented  by  the  so  called  pointwise 
iteration.  Here,  the  three  equations  are  solved 
simultaneously  at  a  given  grid  point,  treating  the 


variables  at  adjacent  points  as  constants,  and  this 
procedure  is  repeated  for  all  grid  points.  Thus,  the 
coupling  between  the  equations  is  maintained  at 
each  grid  point,  whereas  the  grid  points  are 
coupled  to  each  other  only  loosely  through  the 
update  of  values  at  the  neighbors. 

The  pointwise  iteration  does  not  need  a  large 
matrix  to  be  stored  as  the  equations  are  solved 
simultaneously  one  point  at  a  time;  this  makes  the 
iteration  very  attractive  from  the  memory  point  of 
view.  This  procedure  has  been  used  as  the 
smoothing  iteration  in  nonlinear  multigrid  solu¬ 
tion  of  the  device  equations  [9-12].  Also,  in  the 
Alternate  Block  Factorization  [7]  and  ILU/Knot 
[6]  methods,  the  idea  of  coupling  the  three 
equations  at  each  grid  point  has  been  used, 
although  not  explicitly  as  an  iteration. 

In  our  numerical  experiments  we  have  used 
rectangular  grids,  and  it  is  convenient  to  adopt  a 
red-black  reordering  as  in  [12],  where  the  mesh 
points  are  arranged  as  red  and  black  squares  on  a 
checker  board.  The  three  equations  are  solved  for 
all  red  points  first,  followed  by  all  black  points.  At 
each  grid  point,  we  have  three  equations  in  three 
unknowns  (^,  n ,  and  p ).  In  our  specific  imple¬ 
mentation,  we  first  perform  a  few  Gummel-type 
iterations.  If  the  residuals  are  not  smaller  than  a 
specified  tolerance,  at  the  end  of  the  Gummel-type 
iterations,  we  switch  to  Newton’s  method  on  the 
3x3  system  to  solve  the  equations  at  that  grid 
point. 

Since  the  Gummel  and  the  Pointwise  iterations 
have  complementary  aspects,  it  is  interesting  to 
combine  the  two  approaches  to  examine  the  effect 
on  the  convergence  rate.  This  combined  iteration, 
which  we  call  the  “GPM”  iteration  consists  of  the 
following:  (a)  perform  one  Gummel  iteration,  (b) 
perform  M  times  the  pointwise  iteration.  As  we 
will  show  later  through  numerical  examples,  this 
scheme  can  significantly  accelerate  the  conver¬ 
gence. 

To  complete  the  comparison,  we  have  also 
implemented  a  NonLinear  MultiGrid  method 
(NLMG)  as  described  in  [12].  Here,  the  pointwise 
iteration  described  above  is  used  for  smoothing, 
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full  weighting  for  the  restriction  of  residuals, 
injection  for  the  restriction  of  solution,  and 
bilinear  interpolation  for  transferring  error  from 
coarse  to  fine  level.  We  have  implemented  V-cycle 
with  i/i  =  2  and  v2  =  1  where  v\  and  v2  are  the 
number  of  pre-smoothing  and  post-smoothing 
steps  respectively.  For  the  NLMG  iteration  to 
converge,  the  coarse  level  cannot  be  arbitrarily 
coarse  as  pointed  out  in  [12].  For  our  examples 
with  a  limited  grid  size,  we  found  that  the  iteration 
did  not  converge  with  three  levels,  therefore,  only 
two  levels  have  been  used.  Two  approaches  can  be 
used  to  solve  the  problem  on  the  coarsest  level:  (a) 
The  three  equations  may  be  solved  simultaneously 
at  all  grid  points  using  Newton’s  method  as  in  [12]. 
(b)  A  large  number  (typically  50  to  100)  of 
smoothing  iterations  may  be  performed  to  solve 
the  coarse-grid  problem  approximately.  This 
approach  is  also  employed  in  [10]  although,  in 
that  work,  the  grid  points  are  scanned  in  a 
different  order.  Both  approaches  have  been  tried 
here.  We  will  refer  to  one  V-cycle  as  one  Non- 
Linear  MultiGrid  (NLMG)  iteration,  NLMG  with 
Newton’s  method  at  the  coarse  level  as  NLMG 
(N)  and  NLMG  with  m  smoothing  iterations  at 
the  coarse  level  as  NLMG  (raS). 


3  SIMULATION  RESULTS 

Because  of  space  limitations,  we  present  here  only 
two  examples  of  2-D  simulation,  involving  a 
silicon  p+n  junction  with  a  uniform  substrate  n- 
doping  of  1017  cm-3)  and  a  Gaussian  diffusion  for 
the  p+  region  with  peak  density  5xl018  cm-3  at 
the  contact,  and  a  3-D  example  with  a  «  +  n  with 
the  same  doping  levels.  To  facilitate  the  compar¬ 
ison,  we  use  here  a  uniform  grid  with  33x33  points 
in  2-D  and  17x17x17  in  3-D.  Similar  tests 
conducted  with  non-uniform  rectangular  grids 
produced  very  similar  behavior. 

For  the  2-D  simulation  we  consider  a  situation 
of  large  forward  bias  (0.8  V)  where  the  solution  for 
0.6  V  forward  bias  is  used  as  the  initial  guess 
(Figs.  1(a)  and  1(b)),  and  also  a  reverse  bias  of 


FIGURE  1  2-norm  of  residual  versus  iteration  number  for  2D 
p+n  device  with  a  forward  bias  of  0.8  V  and  the  solution  for 
0.6  V  as  the  initial  guess,  (a)  1:  Gummel,  2:  GPi,  3:  GP25. 
Damping  was  employed  for  the  first  5  iterations,  (b)  1:  NLMG 
(N),  2:  NLMG  (50  S),  3:  NLMG  (100  S). 


0.5  V  with  the  equilibrium  solution  as  the  initial 
guess  (Fig.  2).  It  is  clearly  seen  that  the  GP^ 
iteration  converges  in  significantly  fewer  iterations 
than  the  Gummel  iteration  in  all  cases.  For  the 
forward  bias  case,  GPi  and  NLMG  (N),  respec¬ 
tively,  are  the  most  efficient  in  terms  of  number  of 
iterations  required  for  convergence.  Note  also  that 
the  convergence  of  the  NLMG  (raS)  iteration  is 


FIGURE  2  2-norm  of  residual  versus  iteration  number  for  2D 
p+n  device  with  a  reverse  bias  of  0.5  V  and  equilibrium  initial 
guess.  Damping  was  employed  for  the  first  5  iterations. 
1:  Gummel,  2:  GPi. 
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FIGURE  3  2-norm  of  residual  versus  iteration  number  for  3D 
n+n  device  with  0.5  V  and  equilibrium  initial  guess.  1:  Gummel, 
2:  GP25.  Damping  was  employed  for  the  first  15  iterations. 


slow  even  for  m=  100  which  was  found  to  be  very 
effective  instead  in  other  simulations,  not  shown 
here,  involving  a  forward  biased  n  +  -n  junction. 
For  reverse  bias,  our  simple  NLMG  iteration  did 
not  converge.  We  found  that  the  coarse-grid 
problem  for  this  case  does  not  represent  the  fine- 
grid  problem  accurately  enough  because  the 
depletion  region  on  the  n  side  is  only  two  or  three 
grid  points  wide.  A  locally  adapted  multigrid 
method  [10]  is  more  appropriate  for  this  situation. 
In  terms  of  CPU  time  for  the  example  of  Figure  1(a), 
the  cost  per  iteration  on  the  33x33  grid  was  found  to 
be:  0.155  s  (Gummel);  0.118  s  (Pointwise);  0.714 
(NLMG(N)).  This  translates  into  total  solution 
times  of:  6.1  s  (Gummel);  6.8  s  (GP]);  32.0  s  (GP25); 
8.8  s  (NLMG(N));  24.0  s  (NLMG(50S));  22.0  s 
(NLMG(lOOs)).  This  shows  that  in  2-D  problems 
there  is  little  computational  advantage  in  using 
alternative  iteration  schemes. 

However,  as  drift-diffusion  problems  are  pushed 
into  3-D,  the  combined  Gummel-Pointwise  itera¬ 
tion  has  immediately  a  marked  advantage  over  the 
traditional  Gummel  iteration,  since  the  computa¬ 
tional  cost  of  the  pointwise  iteration  only  grows 
linearly  with  the  number  of  points.  As  a  final 
example,  we  consider  a  3D  n  +  n  structure,  with  a 
17x17x17  grid  which  is  uniform  in  each  direction, 
with  a  voltage  of  0.5  V  applied  to  the  n+  contact. 
The  equilibrium  solution  is  used  as  the  initial 


guess.  For  this  example,  Poisson’s  equation  was 
solved  using  the  multigrid  method  (correction 
scheme),  and  the  continuity  equations  were  solved 
using  YSMP  [8].  The  residual  for  the  Gummel  and 
the  GP25  iterations  are  shown  in  Figure  2.  Again, 
the  GP  iteration  converges  in  fewer  iterations  than 
Gummel.  But  while  GP25  requires  much  more 
CPU  time  to  solve  the  problem  for  a  similar  2-D 
n+n  structure  (83.0  s,  as  opposed  to  29.0  s  for 
Gummel  iteration),  in  the  3-D  case  the  Gummel 
solution  requires  1065.0  s,  and  the  GP25  solution 
only  595.0  s. 

We  conclude  that  the  proposed  Gummel-Point¬ 
wise  iterative  scheme  in  general  improves  the 
convergence  behaviour  of  the  Gummel  scheme 
significantly.  In  terms  of  CPU  time,  the  new 
scheme  is  advantageous  for  certain  2D  cases;  and 
for  3D  simulation,  it  is  expected  to  be  always  more 
effective  than  Gummel.  It  is  suggested  that 
implementation  of  the  pointwise  iteration  on  a 
parallel-architecture  computer  will  enhance  its 
effectiveness.  The  red  black  pointwise  iteration 
can  be  naturally  parallelized,  as  all  red  (or  black) 
points  can  be  processed  in  parallel  and  indepen¬ 
dently.  This  will  drastically  reduce  the  cost  of  the 
pointwise  iteration,  and  the  GP  iteration  will  then 
be  only  about  as  expensive  as  Gummel  iteration 
but  with  much  better  convergence  characteristics. 
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The  Mutation  Operator  Monte  Carlo  method  (MOMC)  is  a  new  type  of  Monte  Carlo 
technique  for  the  study  of  hot  electron  related  effects  in  semiconductor  devices.  The 
MOMC  calculates  energy  distributions  of  electrons  by  a  physical  mutation  of  the 
distribution  towards  a  stationary  condition.  In  this  work  we  compare  results  of  an  one 
dimensional  simulation  of  an  800  nm  Si-MOSFET  with  full  band  Monte  Carlo 
calculations  and  measurement  results.  Starting  from  the  potential  distribution  resulting 
from  a  drift  diffusion  simulation,  the  MOMC  calculates  electron  distributions  which  are 
comparable  to  FBMC-results  within  minutes  on  a  modem  workstation.  From  these 
distributions,  substrate  and  gate  currents  close  to  experimental  results  can  be  calculated. 
These  results  show  that  the  MOMC  is  useful  as  a  post-processor  for  the  investigation  of 
hot  electron  related  problems  in  Si-MOSFETs.  Beside  the  computational  efficiency,  a 
further  advantage  of  the  MOMC  compared  to  standard  MC  techniques  is  the  good 
resolution  of  the  high  energy  tail  of  the  distribution  without  the  necessity  of  any 
statistical  enhancement. 


Keywords:  Monte  Carlo  simulation,  hot  electrons 


1  INTRODUCTION 

During  the  last  two  years  we  have  developed  a  new 
approach  to  hot  electron  related  problems  in  Si- 
MOSFETs,  which  is  based  on  a  mixture  of 
Evolutionary  optimization  algorithms  (EA)  [1] 
and  Monte  Carlo  (MC)  technique  [2].  The  EA  is 
used  to  optimize  energy  distributions  of  electrons 


towards  an  agreement  with  measurement  results 
like  substrate  or  gate  currents.  A  major  problem  in 
this  quasi-backward  calculation  of  electron  dis¬ 
tributions  is  that  many  different  distributions  fit  to 
one  measured  current  value.  It  is  very  difficult  or 
even  impossible  to  force  the  EA  towards  physical 
correct  distributions  only  by  restrictions.  We 
overcome  this  problem  by  using  a  Physical 
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Mutation  Operator  (PMO)  within  the  EA.  The 
PMO  forces  the  EA  to  converge  to  distributions 
which  are  in  agreement  with  the  physical  model  of 
the  PMO.  The  PMO  itself  is  the  basis  for  a  novel 
type  of  Monte  Carlo  technique,  the  Mutation 
Operator  Monte  Carlo  method  (MOMC),  which 
applies  the  PMO  to  an  initial  distribution  repeat¬ 
edly,  until  a  stationary  condition  is  reached.  The 
effectiveness  of  the  EA  depends  greatly  on  the 
quality  of  the  PMO,  which  should  be  at  the  same 
time  accurate  and  inexspensive.  The  present  work 
focuses  on  specific  details  and  applications  of  the 
MOMC  itself.  The  approach  is  built  on  a  relatively 
simple  transport  model  which  is  calibrated  with  a 
traditional  Monte  Carlo  simulation.  We  would 
like  to  stress  that  since  the  MOMC  was  conceived 
as  the  first  approximation  step  for  an  EA 
approach,  it  cannot  be  as  accurate  in  all  details 
as  the  full  band  Monte  Carlo.  But  when  the 
parameters  of  the  MOMC  are  well  calibrated  from 
Monte  Carlo  simulation  results,  as  done  here,  the 
MOMC  can  then  be  used  as  stand-alone  method 
when  one  needs  a  rapid  approximate  evaluation  of 
hot  carrier  distribution.  Ideally,  this  method  should 
be  used  as  a  first  order  test,  to  select  conditions  for 
which  a  more  costly  Monte  Carlo  simulation 
should  be  run. 


2  MUTATION  OPERATOR  MONTE  CARLO 
METHOD 

The  main  part  of  the  MOMC  is  the  PMO,  which 
mutates  an  energy  distribution  of  electrons  by  a 
Monte-Carlo-like  simulation  of  the  electron  trans¬ 
port  (see  Fig.  1).  The  distribution  in  a  small 
interval  around  a  randomly  choosen  energy  Es tart 
and  position  xstart  is  associated  to  a  MC-electron. 
The  drift  and  scattering  processes  of  this  electron 
are  simulated  in  the  same  way  as  in  a  standard 
MC-simulation,  until  a  final  energy  £final  and 
position  xfinal  are  reached,  and  the  electron  density 
represented  by  the  MC-electron  is  returned  to  the 
distribution.  For  the  MC-simulation  of  the  elec¬ 
tron  movement  a  simple  one-valley  effective  mass 


approach  is  used,  taking  the  effective  mass  meff  as  a 
free  parameter.  Scatterings  due  to  phonon  absorp¬ 
tion,  phonon  emission  and  impact  ionization  are 
included  in  the  model.  The  phonon  scattering  rates 
are  calculated  by  a  full  band  approach  [3].  Instead 
of  many  rates  for  scattering  between  different 
valleys  one  averaged  scattering  rate  for  absorbtion 
and  one  for  emission  is  used  (see  [2]  for  details). 
For  impact  ionization  the  rate  computed  in  [4] 
was  taken  (assuming  zero  field  and  no  collision 
broadening).  The  phonon  scattering  and  the 
impact  ionization  rates  are  then  multiplied  by  the 
factors  /phonon  and  /mpa ct,  respectively.  Introduc¬ 
tion  of  these  two  parameters  gives  us  more 
flexibilty  to  calibrate  the  simple  model  by  compar¬ 
ison  with  full  band  approaches. 

The  Monte  Carlo  simulation  in  the  PMO  needs 
the  carrier  density  and  electric  field  or  potential 
profile  in  the  simulation  region  as  input  (along  the 
channel  in  case  of  Si-MOSFETs). 

Since  our  main  interest  is  to  use  the  MOMC  to 
calculate  the  high  energy  tail  of  the  distribution, 
which  has  only  a  small  impact  on  the  total  carrier 
distribution,  in  this  approach  the  electric  field  and 
the  carrier  density  are  taken  as  inputs  and 
Poisson’s  equation  is  not  solved  self-consistently. 

From  the  electron  distributions  calculated  by 
the  MOMC  substratecurrents  7sub  are  calculated 
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assuming  that  /sub  is  determined  by  the  amount  of 
holes  created  by  impact  ionization: 

\x)Wa  {E)dEdx, 

where  e  is  the  electron  charge,  Wa{E)  is  the  impact 
ionization  rate  and  fd(E,x )  the  energy  distribution 
of  electrons  integrated  over  the  depth  of  the 
simulated  region  (with  electron  distribution  fd(E, 
x)  we  denote  the  distribution  function  times 
density  of  states). 

The  gate  current  7gate  is  estimated  using  a  WKB 
approach  with  Schottky  lowering  of  the  barrier  [5]: 

fgate  =  ^hit  *  e  *  / 

./channel 

where  PtransC^  x)  is  the  probability  to  cross  the 
Si02-layer.  We  assumed  that  the  rate  at  which 
electrons  trying  to  cross  the  Si/Si02  interface  is 
proportional  to  the  carrier  density.  The  propor¬ 
tional  constant  Arhit  =  7.510_9s_1  was  calibrated 
by  a  comparison  with  FBMC  and  measurement 
results. 


/ 


fd(E ,  x)Ptrans(£,  x)dE dx, 


Tsub 


/  r 


3  RESULTS  AND  DISCUSSION 

In  this  work  we  present  results  from  an  one 
dimensional  simulation  of  a  0.8  pm  Si-MOSFET, 
for  which  experimental  data  were  published  in  the 
classic  paper  by  Ng  et  al.  [6].  The  input  for  the 
MOMC,  i.e.  the  carrier  density  and  electric  field 
profile,  was  calculated  by  a  Full  Band  Monte 
Carlo  (FBMC)  program  developed  at  the  Uni¬ 
versity  of  Illinois  and  by  a  HydroDynamic  (HD) 
model  (ISE  AG,  Zurich). 

Figure  2  shows  the  electron  distribution  given 
by  MOMC  and  FBMC  for  a  drain  voltage  Vd  =  6  V 
and  a  gate  voltage  Fg  =  7V  in  the  region 
important  for  substrate  and  gate  current  genera¬ 
tion,  i.e.  the  crossover  from  channel  to  drain  (the 
drain  region  starts  at  location  x  =  1  pm;  source  and 
substrate  contacts  are  held  at  0  V  bias).  MC- 
MOMC  denotes  the  MOMC  results,  which  were 


FIGURE  2  Energy  distribution  of  electrons  in  a  800  nm  Si- 
MOSFET(Fdrain  =  6 V,  Fgate  =  7V)  at  the  crossover  from 
channel  to  drain.  MOMC  results  calculated  from  FBMC  input 
(dashed  line,  MC-MOMC)  and  HD  input  (dotted  line,  HD- 
MOMC)  are  compared  with  FBMC  simulation  results  (solid 
line). 


obtained  using  the  FBMC  results  for  carrier 
density  and  electric  field  as  input,  and  HD-MOMC 
the  MOMC  results  calculated  on  the  basis  of  HD- 
model  input.  MOMC  results  calculated  from 
FBMC  input  are  in  fairly  good  agreement  with 
the  FBMC  results,  especially  at  the  crossover 
between  channel  and  drain  at  1  pm.  The  HD- 
MOMC  distribution  does  not  fit  the  FBMC  results 
as  well  but  is  reasonably  close  for  x  <  1  pm. 

The  difference  between  HD-MOMC  distribu¬ 
tions  and  FBMC  results  for  x<\  pm  can  be 
explained  by  looking  at  Figure  3.  Here,  the  electric 
field  and  drift  velocity  are  plotted  as  functions  of 
x.  The  HD-model  gives  a  smaller  rise  of  the  electric 
field  for  x  <  0.9  pm  and  then  a  steeper  increase  up 
to  the  peak  at  1.01  pm,  leading  to  a  later  heating  of 
the  electrons.  This  explains  the  lower  electron 
distribution  calculated  from  HD-MOMC  input  for 
x  <  1  pm.  The  drift  velocityprofiles  from  FBMC 
and  MC-MOMC  calculations  are  comparable. 
Both  simulations  yield  a  broad  peak  between 
0.92  pm  and  1.02  pm.  The  HD-MOMC  simulation 
leads  to  a  sharper  peak  at  x  =  0.99  pm  reflecting 
the  sharper  peak  of  the  electric  field. 

The  good  resolution  of  the  high  energy  tail  by 
the  MOMC  becomes  obvious  by  looking  at  the 
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FIGURE  3  Drift  velocity  and  electric  field  profile  along  the 
channel  for  an  800  nm  Si-MOSFET  (Fdrain“6  V,  Kgate  =  7  V) 
calculated  by  FBMC  simulation  (solid  lines)  and  MOMC 
simulation  using  FBMC  results  (dashed  lines)  or  HD-results 
(dotted  lines)  as  input. 

distributions  above  4  eV  (Fig.  2).  The  FBMC 
result  drops  to  zero  when  the  carrier  density  goes 
below  l-10  cm-2.  The  MOMC  resolves  this  regime 
with  the  same  resultion  as  the  low  energy  part. 
Theoretically,  the  resolution  of  the  MOMC  is 
independent  of  the  order  of  magnitudes  displayed. 
This  could  be  proven  by  bulk  simulations  up  to  80 
orders  of  magnitude.  A  statistical  enhancement, 
which  was  used  in  the  FBMC  simulation,  is  not 
necessary.  Therefore,  the  MOMC  could  also  be 
used  as  a  post-processor  for  standard  MC  simula¬ 
tions,  in  order  to  avoid  the  computational  com¬ 
plexity  connected  with  statistical  enhancement. 
Figure  4  compares  substrate  and  gatecurrents 
calculated  from  the  three  different  approaches 
with  the  measurement  results  of  Ng  et  al. 
Obviously  FBMC,  MC-MOMC  and  HD-MOMC 
give  an  excellent  agreement  between  measured  and 
calculated  substrate  currents.  For  gate  currents  the 
agreement  is  not  as  good,  showing  differences  up 
to  one  order  of  magnitude.  Interestingly  the  gate 
currents  calculated  from  FBMC  distributions  are 
not  closer  to  the  measurement  results  than  those 
from  MC-MOMC  and  HD-MOMC.  It  is  not 
possible  to  judge  from  the  comparison  with 
measurement,  which  simulation  approach  yields 
the  more  realistic  electron  distributions. 

The  above  results  demonstrate  that  the  MOMC 
gives  energy  distributions  of  electrons  which  are 


FIGURE  4  Comparison  of  substrate  (top  curves)  and  gate 
currents  (bottom  curves)  calculated  from  FBMC,  MC-MOMC 
and  HD-MOMC  results  with  experimental  data  from  Ng  et  al. 

[51- 

close  to  FBMC  results  and  in  agreement  with 
measurement  results  even  when  HD-model  calcu¬ 
lations  are  used  as  input.  For  each  ( Vd ,  Vg)- pair  in 
Figure  4,  the  MOMC  calculates  the  corresponding 
distribution  within  several  minutes  on  a  HP735 
workstation.  Thus,  the  computational  cost  is 
comparable  to  HD-simulation  and  significantly 
less  than  FBMC  calculations.  This  makes  the 
MOMC  a  very  interesting  post  processor  for  HD- 
simulations  to  gain  insight  on  information  like  the 
energy  distribution  of  carriers,  without  increasing 
the  computational  expense.  The  discussion  above 
on  the  difference  between  FBMC  and  HD-MOMC 
distributions  demonstrates  also  that  such  informa¬ 
tion  is  important  for  the  interpretation  of  simula¬ 
tion  results  like  the  effect  of  slight  changes  in  the 
electric  field  profile  on  the  electron  distribution 
and  related  hot  electron  effects. 


4  CONCLUSIONS 

A  comparison  of  MOMC  results  with  FBMC 
calculations  and  measurement  results  for  a  800  pm 
Si-MOSFET  shows  that  the  MOMC  can  be  used 
as  a  fast  post  processor  in  device  simulation.  The 
method  is  based  on  a  physical  mutation  operator 
which  changes  (mutates)  electron  distributions  by 
a  Monte  Carlo  like  simulation  of  the  electron 
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movement  using  a  simple  effective  mass  model 
including  electron-phonon  scattering  and  impact 
ionization.  Energy  distributions  of  electrons  cal¬ 
culated  by  the  MOMC  from  hydrodynamic  drift 
diffusion  calculations  were  found  to  be  compar¬ 
able  to  FBMC  results.  The  computational  cost  for 
the  MOMC  calculations  is  comparable  to  or 
smaller  than  the  cost  for  hydrodynamic  calcula¬ 
tions.  A  next  step  is  the  application  of  the  MOMC 
to  short  channel  MOSFETs  in  the  sub  100  nm 
range.  Further  improvements  of  the  physical 
model,  a  nonparabolic  approach  to  the  bandstruc- 
ture  and  additional  scattering  rates  are  planned. 
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This  paper  presents  a  comprehensive  physical  model  for  the  breakdown  process  in 
HEMTs.  The  model  is  integrated  into  in  a  fast  quasi-two-dimensional  HEMT  physical 
simulator.  The  work  is  based  on  a  full  study  of  the  complex  interactions  between  the 
different  breakdown  mechanisms  and  the  influence  of  design  parameters.  The  model 
takes  account  of  tunnelling  effects  in  the  region  of  the  gate  metallization,  and  of  the 
thermal  effects  in  the  active  channel  under  the  gate  region. 


Keywords:  Gate  tunneling,  thermal  modelling,  breakdown,  quasi-two-dimensional  modelling 


1.  INTRODUCTION 

Microwave  circuits  such  as  power  amplifiers 
operate  under  large-signal  conditions.  Their  ability 
to  perform  efficiently  is  limited  by  the  devices’ 
breakdown  characteristics,  which  limit  the  transis¬ 
tor’s  performance  and  power  output.  The  accuracy 
of  the  large-signal  design  relies  on  the  availability 
of  suitable  breakdown  models.  Popular  break¬ 
down  theories  have  not  been  adequate  to  indepen¬ 
dently  explain  the  full  picture  of  the  breakdown 
process  in  HEMTs.  The  effects  of  the  gate  leakage 
and  substrate  conduction  in  HEMTs  on  this  have 
not  been  simulated.  Analytical  models  for  ava¬ 


lanche  breakdown,  such  as  Frensley’s  [1],  were 
based  on  physical  simulations  of  the  active  channel 
around  the  gate.  The  effects  of  the  gate  leakage 
and  substrate  conduction  on  this  have  not  been 
included.  This  paper  considers  breakdown  more 
comprehensively  including  the  effects  of  reverse 
gate  conduction,  thermal  fluctuations  within  the 
active  region  and  substrate  conduction  [2]. 

2.  MODEL  DESCRIPTION 

The  Quasi-Two-Dimensional  (Q2D)  model  used 
here  is  based  on  the  earlier  work  of  Snowden  [3]. 
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The  Q2D  method  is  based  on  the  assumption  that 
the  fundamental  driving  force  for  electron  trans¬ 
port  is  the  x-directed  electric  field.  The  HEMT 
Q2D  model  important  features  have  been  reported 
previously  [4].  In  contrast  to  MESFETs,  spurious 
substrate  current  occurs  in  HEMT  buffer  layers 
due  to  the  lateral  Ex  field  component.  This  current 
is  drawn  around  the  depleted  channel  and  reduces 
the  magnitude  of  the  field.  It  is  assumed  in  this 
paper  that,  as  a  result  of  the  reduction  in  the 
electric  field,  an  increase  in  the  breakdown  voltage 
is  possible.  Figure  1  shows  a  schematic  diagram  of 
a  HEMT  device. 

The  computational  interpretation  of  the  break¬ 
down  model  presented  in  this  paper  is  based  on  the 
interactions  between  the  avalanche  and  gate 
leakage  mechanisms.  The  relation  that  both 
processes  simultaneously  have  with  the  device 
design  parameters  and  power  dissipation  inside 
the  device  is  included.  The  flow  chart  of  the  model 
is  shown  in  Figure  2.  The  gate  leakage,  conven¬ 
tionally  termed  ‘soft  breakdown’,  is  assumed  to 
always  occur  in  devices  prior  to  avalanche.  The 
flow  of  electrons  from  the  gate  into  the  semicon¬ 
ductor  would  then  influence  the  impact  ionisation 
process.  Adding  the  leaked  electrons  to  the 
channel  electrons  constituting  the  increasing  drain 
current  imposes  this  effect.  The  leakage  is  allowed 
in  the  three  pinchoff  stages. 


FIGURE  1  Schematic  diagram  of  a  HEMT  device  showing 
electrons  leaking  form  gate. 


FIGURE  2  Flow  chart  of  the  gate  breakdown  model. 


3.  THERMALLY  DRIVEN  GATE  LEAKAGE 
CURRENT 

The  gate  leakage  model  is  principally  based  on  a 
combination  of  Padovani  [5]  and  Rideout  [6] 
equations.  These  equations  involve  complex  func¬ 
tions  of  temperature,  barrier  height  and  semicon¬ 
ductor  parameters.  The  form  that  these  equations 
are  presented  in  the  above  references  is  quite 
complicated.  In  order  to  maintain  the  numerical 
efficiency  of  the  physical  device  simulator,  new 
approximations  are  introduced  in  this  paper, 
which  simplify  the  tunneling  functions  and  main¬ 
tain  the  accuracy  of  the  solution.  Tunneling 
through  the  gate  metal-semiconductor  barrier 
becomes  significant  in  the  reverse  direction  than 
in  the  forward  direction  because  the  bias  voltages 
involved  are  usually  greater.  This  cause  the 
potential  barrier  to  become  thin  enough  such  that 
tunneling  is  dominant.  The  Thermionic  (T), 
Thermionic-Field  (T-F)  and  Field  Emission  (FE) 
gate  leakage  currents  are  dependent  on  the  lattice 
temperature  and  material  specifications.  What 
determines  the  current  mechanism  is  the  tempera¬ 
ture  of  the  channel  and  the  applied  bias.  A  new 
thermal  model  [7]  incorporated  into  the  Q2D 
physical  model  computes  the  instantaneous  tem¬ 
perature  of  the  active  channel.  The  output  of  this 
model  is  linked  with  the  breakdown  model.  This 
assists  the  dynamic  update  of  the  gate  leakage 
current  mechanism. 
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3.1.  Thermionic-field  Tunneling  Current 

This  mechanism  is  the  major  form  of  tunneling. 
The  tunneling  current  is  defined  by  the  equation 

IT-F  =  AJS  exp(ii/ e')  (1) 


It  was  observed  during  the  course  of  this  work 
that  in  the  saturation  current  equation  (4),  the 
square  root  term  was  dominated  by  the  first  term. 
Hence  a  simpler  approximate  numerical  expres¬ 
sion  for  the  saturation  current  was  deduced: 


Where  A  is  the  gate  area,  E  is  applied  energy 
calculated  from  qVn  Vr  being  the  applied  reverse 
bias  voltage  given  a  positive  sign  throughout  this 
work,  q  is  the  electron  charge  and  ef  is  an  energy 
term  defined  as: 


(2) 


The  term  E00  quantifies  the  diffusion  potential 
from  metal  into  semiconductor.  It  has  two  equi¬ 
valent  definitions  given  in  [5]  and  [6].  The  latter 
definition  is  adopted  in  this  work,  however,  slight 
numeric  alteration  is  needed  for  consistency  of 
units: 


(3) 


N  is  the  doping  density  per  m  3.  In  order  to  use 
equation  (3)  in  (2),  Eao  must  be  converted  back  by 
multiplying  by  q,  the  electron  charge.  The  type  of 
HEMT  used  connects  the  gate  metalisation 
directly  to  the  TV-doped  AlGaAs  layer. 

In  equation  (1),  Js  is  defined  as  the  saturation 
current  given  in  [5]  as: 


Js  = 


RjjrEoo) 

KT 
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q<pb 


cosh2 


x  exp 


(zi M 

\E0  ) 


KT 


(4) 


where  (j>b  is  the  Schottky  barrier  height.  Ea  is  a 
term  defined  as: 


E0  =  E00  coth 


(5) 


Js  =  ^[tt EooqVr}'12  exp  (6) 

which  is  integrated  into  the  model.  An  empirical 
difference  limit  between  equations  (4)  and  (6)  was 
reached  after  some  experimentation’s  beyond 
which  if  this  limit  is  exceeded,  the  solution 
obtained  from  equation  (6)  was  observed  to  affect 
the  numerical  accuracy  of  the  tunneling  current. 
The  model  then  switches  to  the  more  stringent 
expression  of  equation  (4).  The  thermionic  and 
field  emission  currents  are  calculated  using  similar 
equations.  Threshold  equations  are  incorporated 
which,  according  to  the  thermal  status  of  the 
device,  the  appropriate  leakage  mechanisms  is 
invoked. 


4.  SIMULATION  RESULTS 

Figure  3  shows  the  effect  of  varying  the  tem¬ 
perature  on  the  tunnelling  currents  at  various 


FIGURE  3  Tunneling  current  versus  channel  temperature  at 
various  doping  concentrations. 
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doping  levels.  An  increase  in  the  T-F  current  is 
observed  with  an  increase  in  the  doping 
concentration.  Figure  4  compares  between  the 
thermionic-field  and  field  emission  currents  with 
respect  to  temperature.  The  T-F  current  is  clearly 
of  more  prominent  effect.  The  simulation  of  the 
DC  I-V  characteristics  of  a  HEMT  device  enabling 
gate  current  tunnelling  mechanism  is  shown  in 
Figure  5.  The  soft  breakdown  mechanism  is 
influenced  by  the  design  parameters  and  applied 


FIGURE  4  Comparison  between  the  thermionic-field  and 
field-emission  leakage  currents. 


DC  Characterstics  of  a  HEMT 


FIGURE  5  I-V  characteristics  of  a  HEMT  device  incorporat¬ 
ing  the  gate  model. 


bias.  It  can  be  practically  and  numerically 
observed  when  the  stimulating  conditions  are 
met.  Comparisons  with  measured  data  in  [8]  shows 
good  agreement  between  simulated  and  measured 
results  of  a  MESFET  case. 

5.  CONCLUSION 

The  Q2D  physical  device  simulation  of  HEMTs  has 
been  expanded  to  include  soft  thermal  breakdown 
effects  using  tunnelling  current  mechanisms.  The 
influence  of  soft  breakdown  mechanism  on  the  DC 
characteristics  is  influenced  by  the  design  para¬ 
meters  and  applied  bias.  The  thermal  effects  in  the 
active  channel  under  the  gate  region  are  included. 
This  is  conducted  using  an  incorporated  thermal 
model,  which  calculates  the  channel  temperature 
and  updates  the  thermal  breakdown  model. 


References 

[1]  Frensley,  W.  R.  (1981).  “Power-limiting  breakdown 
effects  in  MESFETs,  ”  IEEE  Trans .,  ED-28(8),  962-970. 

[2]  Morton,  C.  G.,  Atherton,  J.  S.,  Snowden,  C.  M.,  Pollard, 
R.  D.  and  Howes,  M.  J.  “A  large-signal  physical  HEMT 
model  ,”  Int.  Microwave  Symp.  MTT-S,  3,  1759-1762, 
San  Francisco. 

[3]  Snowden,  C.  M.  and  Pantoja,  R.  R.  (1989).  “Quasi-two- 
dimensional  MESFET  simulation  for  CAD”,  IEEE 
Trans .,  ED-36(9),  1564-1574. 

[4]  Morton,  C.  G.  and  Snowden,  C.  M.  “HEMT  physical 
model  for  MMMIC  CAD”,  Proceedings  of  25th  European 
Microwave  Conference ,  Sept.  1995. 

[5]  Padovani,  F.  A.  and  Stratton,  R.  (1966).  “Field  and 
thermionic-field  emission  in  schottky  barriers”,  Solid- 
State  Electronics ,  9,  695-707. 

[6]  Crowell,  C.  R.  and  Ridout,  V.  L.  (1969).  “Normalised 
thermionic-field  (T-F)  emission  in  metal-semiconductor 
(schottky)  barriers”,  Solid-State  Electronics ,  12,  89-105. 

[7]  Johnson,  R.  G.,  Snowden,  C.  M.  and  Pollard,  R.  D. 
(1997).  “A  Physics  Based  Electro-Thermal  Model  For 
Microwave  And  Millimetre  Wave  HEMTs”,  International 
microwave  symposium ,  MTT-S,  Denver,  USA. 

[8]  Albasha,  L.,  Snowden,  C.  M.  and  Pollard,  R.  D.  “Break¬ 
down  Characterization  of  HEMTs  and  MESFETs  Based 
on  A  New  Thermally  Driven  Gate  Model”,  International 
Conference  on  Simulation  of  Semiconductors  processes  and 
Devices  SISPAD ,  8-10  Sep.  1997,  Cambridge,  MA,  USA 
(To  be  published). 


Authors’  Biographies 

Lutfi  Albasha  received  his  B.Eng.  (Hons.)  degree  in 
Electronic  and  Electrical  Engineering  from  the 


A  NEW  HEMT  BREAKDOWN  MODEL 


353 


University  of  Leeds  in  1990.  In  1991  he  obtained 
an  M.Sc.  (with  distinction)  in  RF  communica¬ 
tions.  In  1992,  He  joined  the  Microwave  and 
Terahertz  Technology  Group  at  the  University  of 
Leeds  and  obtained  his  Ph.D.  in  1995.  His  thesis 
was  on  the  electromagnetic  modelling  of  micro- 
wave  circuits  and  its  feasibility  as  a  CAD  tool.  In 

1994  he  was  employed  by  Filtronics  Components 
Ltd.,  UK.  Currently  he  is  a  research  fellow  at  the 
University  of  Leeds  conducting  new  studies  on  the 
high  frequency  breakdown  modelling  in  HEMTs 
and  its  measurement  techniques  at  W-band 
frequencies.  His  research  interests  include  electro¬ 
magnetic  numerical  modelling  and  the  design  and 
measurements  of  microwave  circuits  and  devices. 

Christopher  M.  Snowden  received  the  B.Sc., 
M.Sc.  and  Ph.D.  degrees  from  the  University  of 
Leeds.  After  graduating  in  1977  he  worked  as  an 
Applications  Engineer  for  Mullard,  Mitcham.  His 
Ph.D.  studies  were  conducted  in  association  with 
Racal-MESL  and  were  concerned  with  the  large- 
signal  characterisation  of  MESFET  microwave 
oscillators.  In  1982  he  was  appointed  Lecturer  in 
the  Department  of  Electronics  at  the  University  of 
York.  He  joined  the  Microwave  Solid  State  Group 
in  the  Department  of  Electrical  and  Electronic  at 
the  University  of  Leeds  in  1983.  He  now  holds  the 
Chair  of  Microwave  Engineering  in  the  Micro- 
wave  and  Terahertz  Technology  Research  Group 
and  is  also  currently  Head  of  the  Department  of 
Electronic  and  Electrical  Engineering.  During 
1987  he  was  a  Visiting  Research  Associate  at  the 
California  Institute  of  Technology.  He  has  been  a 
Consultant  to  M/A-COM  Inc.,  Corporate  Re¬ 
search  and  Development  since  1989,  where  he  was 
on  sabbatical  leave  during  the  period  1990-91. 
During  this  year  he  represented  M/A-COM  as 
Senior  Staff  Scientist.  He  was  Chairman  of  the 

1995  international  Microwaves  and  RF  Confer¬ 
ence.  He  is  a  Member  of  the  MIT  Electromag¬ 
netics  Academy.  He  is  also  a  Top  Scientist  at  the 
International  Research  Centre  for  Telecommuni¬ 


cations-Transmission  and  Radar,  Delft  University 
of  Technology,  Netherlands.  Professor  Snowden  is 
a  Fellow  of  the  IEEE  and  a  Fellow  of  the  IEE.  He 
is  a  Distinguished  Lecturer  (1996/7)  for  the  IEEE 
(Electron  Devices  Society).  He  is  co-Chairman  of 
the  MTT-1  Committee  and  a  Member  of  the  1997 
IEEE  MTT-S  Technical  Program  Committee.  His 
main  research  interests  include  compound  semi¬ 
conductor  device  modelling,  microwave,  terahertz 
and  optical  nonlinear  subsystem  design  and 
advanced  semiconductor  devices.  He  has  written 
7  books  and  over  190  papers. 

Roger  D.  Pollard  was  born  in  London,  England 
in  1946.  He  received  his  technical  education, 
graduating  with  the  degrees  of  BSc  and  PhD  in 
Electrical  and  Electronic  Engineering,  at  the 
University  of  Leeds,  Leeds,  UK. 

He  holds  the  Hewlett-Packard  Chair  in  High 
Frequency  Measurements  in  the  School  of  Elec¬ 
tronic  and  Electrical  Engineering  at  the  University 
of  Leeds  where  he  has  been  a  faculty  member  since 
1974.  He  is  Deputy  Director  of  the  Institute  of 
Microwaves  and  Photonics  which  has  over  40 
active  researchers,  a  strong  graduate  program  and 
has  made  contributions  to  microwave  passive  and 
active  device  research.  The  activity  has  significant 
industrial  collaboration  as  well  as  a  presence  in 
continuing  eduction  through  its  Microwave  Sum¬ 
mer  School.  Professor  Pollard’s  personal  research 
interests  are  in  microwave  network  measurements, 
calibration  and  error  correction,  microwave  and 
millimetre-wave  circuits  and  large-signal  and  non¬ 
linear  characterization.  He  has  been  a  consultant 
to  the  Hewlett-Packard  Company,  Santa  Rosa, 
CA  since  1981.  He  has  published  over  100 
technical  articles  and  hold  3  patents. 

Roger  Pollard  is  a  Chartered  Engineer,  a  member 
of  the  Institution  of  Electrical  Engineers  (UK)  and 
a  Fellow  of  the  IEEE.  He  is  1998  President  of  the 
IEEE  Microwave  Theory  and  Techniques  Society 
where  he  is  serving  his  second  term  as  an  elected 
member  Administrative  Committee. 


VLSI  DESIGN 

1998,  Vol.  8,  Nos.  (1-4),  pp.  355-360 
Reprints  available  directly  from  the  publisher 
Photocopying  permitted  by  license  only 


1998  OPA  (Overseas  Publishers  Association)  N.V. 

Published  by  license  under 
the  Gordon  and  Breach  Science 
Publishers  imprint. 
Printed  in  India. 


Rate  Equation  Modelling  of  Nonlinear 
Dynamics  in  Directly  Modulated  Multiple 
Quantum  Well  Laser  Diodes 

STEPHEN  BENNETT1 2  *,  CHRISTOPHER  M.  SNOWDEN b  and  STAVROS  IEZEKIEL b 

a  Department  of  Electronic  and  Electrical  Engineering,  University  College  London,  London,  WCIE  7JE,  UK; 
b  Department  of  Electronic  and  Electrical  Engineering,  University  of  Leeds,  Leeds,  LS2  9JT,  UK 


A  theoretical  (using  rate  equations)  and  experimental  study  of  the  nonlinear  dynamics 
of  a  distributed  feedback  multiple  quantum  well  laser  diode  is  presented.  The  analysis  is 
performed  under  direct  modulation.  Period  doubling  and  period  tripling  are  identified 
in  both  the  measurements  and  simulations.  Period  doubling  is  found  over  a  wide  range 
of  modulation  frequencies  in  the  laser.  Computational  results  using  rate  equations  show 
good  agreement  with  the  experimental  results. 


Keywords:  MQW  laser  diodes,  rate  equations,  nonlinear  dynamics,  period  doubling,  period 
tripling 


1.  INTRODUCTION 

Multiple  Quantum  Well  (MQW)  laser  diodes  are 
widely  used  in  many  applications  because  of  their 
superior  performance  characteristics  over  bulk 
laser  diodes.  These  include  higher  modulation 
bandwidths  and  lower  threshold  currents,  both  of 
which  are  desirable  qualities  for  high-speed 
optical  fibre  links.  Directly  modulated  laser 
diodes  are  operated  under  conditions  that  can 
lead  to  a  wide  variety  of  nonlinear  behaviour, 
including  period  doubling  [1]  and  chaos  [2].  It  is 


therefore  vital  to  have  an  accurate  model  of 
nonlinear  behaviour  that  is  numerically  efficient, 
and  for  the  most  part  rate  equations  are  used  by 
most  workers.  However,  in  contrast  to  a  large 
volume  of  work  using  rate  equations  for  non¬ 
linear  modelling  of  bulk  structures,  little  such 
work  exists  for  MQW  lasers.  This  paper  presents 
a  detailed  theoretical  and  experimental  study  of 
the  nonlinear  dynamics  of  an  InGaAsP/InGaAs 
MQW  A/4  shifted  Distributed  Feedback  (DFB) 
laser  diode  with  16  quantum  wells  [3]  under  direct 
modulation. 
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2.  MQW  LASER  DIODE  MODEL 


To  account  for  the  carrier  dynamics  in  quantum 
well  lasers,  a  rate  equation  model  based  on  that 
proposed  by  Nagarajan  et  al  [4]  is  used  in  our 
numerical  simulations.  The  rate  equations  for  the 
carrier  density  in  the  quantum  wells  ( N )  and  the 
barriers  (NB)  and  the  photon  density  in  the  optical 
cavity  ( S )  can  be  written  as, 

dNB  =  Tq/  Nb  TqN 

dt  eV  rc  re 


*£=  *L_  JJL  +  I 

dt  TqTc  _  Tn  Tq 


vg  G(N,S)S 


(2) 


TL-1|,  where  PRF  is  the  incident  RF  input  power, 
TL  is  the  reflection  coefficient  of  the  laser  and  Z0  is 
the  characteristic  impedance.  The  quantum  wells 
have  one  conduction  subband  so  that  the  relation¬ 
ship  between  the  carrier  density  and  the  junction 
voltage  Vj  can  be  approximated  [5], 


Nivkr-^-ln 

irh2Lz 


1  +  exp 


(eFv/ph)~ 


(6) 


where  mc  is  the  electron  effective  mass,  Lz  is  the 
quantum  well  width  and  Ep h  is  the  photon  energy. 
The  MQW  material  gain  G  (N,  S)  is  modelled  by  [6] 


G(N,  S )  = 


go  ln^ 

(1  +  eS)  Nq 


(7) 


S  +  T(3BN2  (3) 

where  Tq  is  the  fraction  of  the  MQW  region  filled 
by  the  quantum  wells,  Nb/tc  is  the  loss  rate  of 
carriers  from  the  barriers  to  the  quantum  wells  and 
N/re  is  the  loss  rate  of  carriers  from  the  quantum 
wells  to  the  barriers.  rp  is  the  photon  lifetime,  V  is 
the  active  region  volume,  T  is  the  mode  confine¬ 
ment  factor,  and  vg  is  the  group  velocity.  The 
dependence  of  the  carrier  lifetime  (rn)  on  N  is 
modelled  as, 

—  =  A  +  BN+CN2  (4) 

Tn 

where  the  terms  BN  and  CN2  model  bimolecular 
and  Auger  recombination  respectively.  The  output 
power  per  facet  P  is  related  to  S  via, 


dS 
dt  : 


rvgG(Ar,  5)  —  — 


P  = 
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where  R  is  the  facet  reflectivity  («  0.32)  and  L  is 
the  cavity  length.  The  injection  current  (/)  can  be 
expressed  /=/dc  + J Rpsin  (In ft),  where  /DC  is  the 
bias  current,  /RF  is  modulation  current  amplitude 
and  /  is  the  modulation  frequency.  Using  an 
isolator  between  the  microwave  source  and  the 
laser  to  minimise  impedance  mismatch  problems 
enables  /RF  to  be  approximated,  /RF  «  a/2Prf/Z0| 


where  e  is  the  nonlinear  gain  coefficient,  N0  is  the 
transparent  carrier  density  and  G0  is  a  constant 
dependant  on  well  structure. 

It  is  known  that  the  fraction  of  the  spontaneous 
emission  coupled  into  the  lasing  mode  (/?  factor) 
plays  a  large  part  in  determining  the  nonlinear 
dynamics  of  laser  diodes  [2,7-9]  and  it  has 
recently  been  found  that  bulk  DFB  laser  diodes 
can  exhibit  a  reduced  (3  factor  [2]  compared  to 
their  FP  counterparts  (8  x  10-7  compared  to  10~5 
in  typical  FP  lasers).  The  value  of  (3  used  in  our 
simulations  was  therefore  measured  using  the  same 
technique  described  in  [2],  Other  parameters  which 
were  not  material  or  dimensional  in  nature  were 


Frequency  (GHz) 

FIGURE  1  Measured  (solid  lines)  and  simulated  (points) 
small-signal  modulation  frequency  response  of  the  laser  with 
increasing  output  power. 
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extracted  from  the  small-signal  measurements 
shown  in  Figure  1.  The  parameter  values  used  in 
the  simulations  were:  V  —  5.1  x  10_17m3,r  =  0.22, 
r9  =  0.66,rp  =  1.3ps,/3  =  10~6,  G0  =  141107  m-1, 
N0  =  2.41xl024m"3,e  =  3.24  x  10-23  m3,  A  =  108 
s-1,  B  =  10-16  m3s-1,  C  =  3  xlO-41  m6s-1, 
rc  =  20  ps,  and  re  =  191  ps.  Simulations  were  per¬ 
formed  by  solving  (l)-(3)  using  a  fourth  order 
Runge-Kutta  scheme. 

3.  NONLINEAR  DYNAMIC  ANALYSIS 


3.1.  Results 

So  as  to  study  the  nonlinear  dynamics  resulting 
from  direct  modulation  of  the  laser  alone,  optical 
feedback  into  the  laser  was  minimised  by  the  use  of 
an  optical  isolator.  The  experimental  setup  used  to 
analyse  the  nonlinear  dynamics  of  the  lasers  is 
shown  in  Figure  2.  The  incident  RF  input  power  to 
the  laser  (PRF)  was  monitored  through  a  direc¬ 
tional  coupler  and  the  problem  of  laser  mismatch 
to  500  was  minimised  through  the  use  of  a 
microwave  isolator. 

For  a  range  of  output  power  levels  *  (P0ut)  and 
modulation  frequencies  (/),  PRF  was  swept 
between  -5  and  22  dBm  and  the  regions  where 
nonlinear  behaviour  occurred  recorded.  These 
regions  are  shown  in  Figure  3  along  with  the 
simulated  results.  There  were  both  regions  of 
period  doubling  and  period  tripling.  Period 
tripling  occurred  when  the  laser  was  biased  with 
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FIGURE  2  Experimental  setup  used  to  investigate  the  non¬ 
linear  dynamics  of  the  laser. 


Modulation  Frequency  (GHz) 


FIGURE  3  Regions  of  where  nonlinear  behaviour  were 
recorded  for  (a)  the  measured  laser,  (b)  the  simulated  laser. 
PD  denotes  the  regions  where  period  doubling  were  recorded 
and  T  the  region  where  period  tripling  was  observed. 


P out  levels  of  between  0.7  mW  and  3.6  mW.  In  the 
region  where  period  tripling  occurred,  as  PRF  was 
increased,  the  route  to  period  tripling  was  always 
via  period  doubling.  Figure  4  shows  the  measured 
and  simulated  output  frequency  spectrum  of  the 
laser  under  conditions  of  single  period,  period 
doubling  and  period  tripling  behaviour. 

There  were  features  of  the  period  doubling 
which  were  common  to  both  simulated  and 
measured  lasers.  Namely  the  upper  and  lower 
modulation  frequency  limits  of  the  period  dou¬ 
bling  regions  seemed  to  follow  one  and  two  times 


Corresponding  to  a  range  of  bias  currents. 
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the  relaxation  frequency  of  the  laser  (/R)  respec¬ 
tively.  For  a  fixed  ratio  of  modulation  frequency  to 
relaxation  frequency  the  value  of  PRF  required  to 
produce  period  doubling  increased  almost  linearly 
with  Pou t.  These  features  are  illustrated  in  Figure  5 
where  the  calculated  values  of  PRF  required  for 
period  doubling  to  occur  in  the  laser  are  displayed 
versus  Pout  and  the  modulation  frequency  normal¬ 
ised  to  /R.  The  measured  and  simulated  values  of 
Frf  required  for  period  doubling  to  occur  in  the 
laser  versus  Pout  are  displayed  in  Figure  6.  When 
obtaining  the  data  displayed  in  Figure  6  the 
modulation  frequency  was  adjusted  at  each  Pout  to 
keep  the  ratio  of  modulation  frequency  to  relaxa¬ 
tion  frequency  constant,  which  for  Figure  6  was 
1.5.  It  is  expected  that  this  relationship  is  due  to 
the  corresponding  increase  in  relaxation  damping 
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FIGURE  4  Measured  and  simulated  frequency  spectrum  of 
the  laser  under  conditions  of  single  period  (PI),  period  doubling 
(P2)  and  period  tripling  (P3)  behaviour. 


Frequency  f  /  fR 

FIGURE  5  Three  dimensional  plot  displaying  the  calculated 
values  of  PRF  required  for  period  doubling  to  occur  in  the  laser 
versus  Pout  and  the  modulation  frequency  normalised  to  /R. 


FIGURE  6  Measured  (points)  and  simulated  (solid  line)  RF 
input  power  required  to  produce  period  doubling  versus  output 
power  per  facet.  The  modulation  frequency  to  relaxation 
frequency  ratio  is  kept  constant  at  1.5. 
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FIGURE  7  Simulated  dependence  of  nonlinear  behaviour  on 
PRF  and  (3  in  the  laser.  Simulations  were  performed  with  a 
modulation  frequency  of  6  GHz  and  Pout  set  at  0.6  mW.  The 
black  areas  in  the  diagram  indicate  regions  of  higher  order 
bifurcations. 


with  power  level  which  is  known  to  be  an 
important  factor  in  characterising  the  nonlinear 
dynamics  of  laser  diodes  [9]. 

Liu  et  al.  [2]  suggest  that  the  promotion  of  period 
tripling  in  their  bulk  DFB  laser  was  due  to  the 
unusually  low  value  of  (3  measured  in  their  laser.  To 
investigate  whether  this  was  the  case  in  our  MQW 
DFB  laser  we  analysed  the  effect  of  variations  in  (3 
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on  our  simulation  results.  Figure  7  shows  the 
regions  of  nonlinear  behaviour  as  (5  and  PRF  are 
varied.  It  is  clear  that  as  (3  is  reduced  the  period 
tripling  behaviour  becomes  more  prominent.  It 
would  therefore  seem  likely  that  the  observed  period 
tripling  in  our  DFB  laser  can  be  attributed  to  the 
low  value  of  /3  measured  in  this  laser  compared  to 
that  measured  in  typical  Fabry  Perot  lasers. 

4.  CONCLUSION 

In  summary,  we  have  performed  a  detailed 
theoretical  (using  rate  equations)  and  experimental 
analysis  of  the  nonlinear  dynamics  of  a  MQW 
DFB  laser  diode  under  direct  modulation.  We 
observed  period  doubling  and  period  tripling  in 
both  measurements  and  simulations.  Period  dou¬ 
bling  was  evident  over  a  wide  range  of  modulation 
frequencies.  Computational  results  using  rate 
equations  show  good  agreement  with  the  experi¬ 
mental  results. 
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Temperature  Dependence  of  the  Electron  and  Hole 
Scattering  Mechanisms  in  Silicon  Analyzed  through  a 
Full-Band,  Spherical-Harmonics  Solution  of  the  BTE 

SUSANNA  REGGIANI,  MARIA  CRISTINA  VECCHI  and  MASSIMO  RUDAN* 
Dipartimento  di  Elettronica,  Informatica  e  Sistemistica,  Universita  di  Bologna,  Viale  Risorgimento  2,  40136  Bologna,  Italy 


By  adopting  the  solution  method  for  the  BTE  based  on  the  spherical-harmonics 
expansion  (SHE)  [1],  and  using  the  full-band  structure  for  both  the  electron  and  valence 
band  of  silicon  [2],  the  temperature  dependence  of  a  number  of  scattering  mechanisms 
has  been  modeled  and  implemented  into  the  code  HARM  performing  the  SHE  solution. 
Comparisons  with  the  experimental  mobility  data  show  agreement  over  a  wide  range  of 
temperatures.  The  analysis  points  out  a  number  of  factors  from  which  the  difficulties 
encountered  in  earlier  investigations  seemingly  originate,  particularly  in  the  case  of  hole 
mobility. 


Keywords :  Carrier  mobility,  scattering  mechanisms,  Boltzmann  equation,  spherical-harmonics 
expansion 


1  INTRODUCTION 

We  present  here  an  investigation  of  the  tempera¬ 
ture  dependence  of  the  scattering  mechanisms 
described  at  microscopic  level  through  the  solution 
of  the  BTE  based  on  an  expansion  in  spherical 
harmonics.  Such  investigation  is  also  relevant  from 
the  application  viewpoint  because  it  allows  for  a 
detailed  analysis  of  the  temperature  and  impurity 
dependence  of  mobility,  hence  of  the  performance 
of  the  submicron  and  power  semiconductor 
devices.  The  mechanisms  considered  here  are 


acoustical-  and  optical-phonon  scattering,  io- 
nized-impurity  scattering,  and  impact  ionization. 
In  particular,  phonon  and  impurity  scattering  have 
a  strong  influence  on  mobility  also  at  low  energies. 
In  addition,  the  effect  of  incomplete  ionization  of 
dopant  has  been  introduced  as  well,  to  account  for 
the  behavior  of  carrier  mobility  at  high  impurity 
concentration  and  low  temperatures. 

By  this  approach  the  experimental  mobility 
curves  of  electrons  and  holes  are  well  reproduced 
in  a  wider  range  of  temperatures  and  impurity 
concentrations  than  in  earlier  investigations. 
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2  PHYSICAL  MODELS 


The  mathematical  model  resulting  from  the  SHE 
of  the  BTE  is  briefly  summarized  here,  in  the 
homogeneous  case,  with  reference  to  both  the 
conduction  and  valence  band: 


2  n'>  ® 

“ f'se 


(1) 


+  3copg(E)Mop  -  3cji g2(E)fo(E) 

+  3 g(E)  I  A(E',E)f0(El)g(E')dE', 

where  Mop  =  (N+p/+-Nop/0)g+  -  (N+pf0  -  Nop 
/o)g“.  The  symbols  have  the  following  meaning: 
g(E)  is  the  density  of  states,  ug(E)  the  modulus  of 
the  group  velocity,  r  the  total  scattering  rate,  F 
the  electric  field,  cop  a  constant  proportional  to  the 
optical-phonon  coupling  constant,  Nop  the  optical- 
phonon  occupation  number,  N+p  —  7VoP  +  1, 
g±(E)  =g(E±Hujl 0p),  where  tiuoop  is  the  optical- 
phonon  energy,  and  similarly  for  fo±(E).  Impact 
ionization  is  also  considered:  cug(E)  is  the  total 
impact-ionization  scattering  rate  and  A(Ef,E)  is  a 
suitable  kernel  [3].  All  the  scattering  mechanisms 
considered  here  exhibit  a  temperature  dependence: 

•  acoustical-phonon  scattering  shows  an  explicit 
dependence  on  temperature  [4]; 

•  optical-phonon  scattering  depend  on  tempera¬ 
ture  through  the  occupation  number  Nop  [4]; 

•  ionized-impurity  scattering  depend  on  tempera¬ 
ture  through  the  partial  ionization  effect; 

•  impact  ionization  depend  on  temperature 
through  the  energy  gap. 
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where  N{  is  the  ionized-impurity  concentration,  /  is 
the  overlap  factor,  Ze  the  impurity  charge,  es  the 
semiconductor  dielectric  constant,  k  the  wave 
vector  magnitude,  f  the  scattering  angle,  (3  the 
inverse  screening  length. 

The  overlap  factor  accounts  for  the  interaction 
between  the  initial  and  final  states  in  a  collision 
event  and  depends  on  the  shape  of  the  band. 
According  to  [4],  it  is  considered  constant  for  the 
conduction  band,  incorporating  its  value  in  the 
coupling  constant  of  each  of  the  scattering 
mechanisms.  For  the  valence  band  the  expression 
obtained  by  Wiley  [4]  has  been  adopted: 

I2{k,k')=\(  1  +  3cos2£), 

which  produces  an  additional  angular  dependence 
in  the  spherical  harmonics  expansion  of  the 
impurity  scattering  term,  while,  for  the  collisions 
with  phonons  the  expansion  of  the  scattering  terms 
turns  out  to  be  unmodified,  apart  from  a  constant 
factor. 

The  inverse  screening  length  is  usually  given  by 
the  Debye  expression  which  is  obtained  by  solving 
Poisson’s  equation  considering  an  isotropic  para¬ 
bolic  band  and  Boltzmann  statistics.  In  this  work 
we  consider  a  more  complicated  description  of  the 
band  structure,  based  on  pseudo-potential  calcula¬ 
tion,  and  therefore  we  compute  the  inverse  screen¬ 
ing  length  following  the  general  definition  [5]: 

P2  =  I g(E)fo(E)(l  -f0(E))dE. 


In  the  following  we  will  focus  on  the  effect  of 
temperature  on  the  carrier  mobility  as  a  function 
of  impurity  concentration,  therefore  we  will 
describe  with  more  details  the  scattering  term 
strongly  related  to  this  kind  of  collision. 

The  ionized-impurity  scattering  is  described 
according  to  the  Brooks-Herring  (BH)  model: 


where  e  electron  charge,  Kq  Boltzmann  constant, 
T0  lattice  temperature,  g(E)  density  of  states, 
fo(E)  first  term  of  SHE  of  distribution  function.  It 
is  necessary  to  include  additional  corrections  to  the 
Brooks-Herring  (2)  scattering  term  to  take  into 
account  the  effects  of  the  Born  approximation  of 
order  higher  than  1  for  incoherent  collision  with  a 
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single  impurity  and  for  coherent  collisions  with 
impurity-pairs.  The  theory  of  multiple  collisions  of 
[6],  in  agreement  with  the  Brooks-Herring  for¬ 
mulation,  considers  an  electron  gas  interacting 
with  N[  impurities  in  the  crystal.  The  main 
difference  between  the  results  of  [6]  and  that  of 
Brooks-Herring  is  that  the  latter  considers  the 
potential  of  a  single  impurity,  resulting  in  a  total 
collision  probability  equal  to  N{  times  the  collision 
probability  of  a  single  impurity.  The  result  of  [6], 
instead,  keeps  the  information  on  the  interaction 
potential  between  electrons  and  impurities,  which 
is  expressed  by  J^/li  K(r  —  Ry),  with  Ry  position 
vector  for  the  y-th  ion,  and  modifies  the  scattering 
matrix  in  such  a  way  that  it  becomes: 

S(E,  E ')  =  Sm(E,  E')[l  +  6m  +  fo] 

where  SM,  <5D  are  multiple  collisions  and  impurity 
dressing  constants. 

The  effect  of  impurity  clustering,  relevant  for 
high  doping  concentrations,  has  also  been  taken 
into  account.  At  high  doping  densities  it  is 
described  as  if  the  carrier  scatters  with  a  cluster 
of  Z  ions,  with  Z  number  of  charge.  The  partial 
ionization,  relevant  for  both  high  doping  concen¬ 
tration  and  low  temperature,  has  been  modeled 
redefining  the  donor  and  acceptor  concentrations 
according  to  the  following  expressions: 

N+  - 

D  1 +2exp[0.045/(^Br)] 

for  electrons;  and 

N.= _ Na _ 

A  1  +4cxP[0.0438/(A:b7’)] 

for  holes,  along  with  the  description  of  the 
equilibrium  carrier  concentration  in  terms  of 
Fermi  statistics. 

3  RESULTS 

The  model  described  in  the  previous  section  has 
been  applied  to  the  investigation  of  electron  and 


hole  mobility  as  a  function  of  impurity  concentra¬ 
tion  and  temperature. 

The  experiments  show  that  the  dependence  of 
the  electron  mobility  on  temperature  in  intrinsic 
silicon  differs  from  that  of  holes.  In  particular,  at 
low  temperatures  (T  <  100  K)  it  is  (in  «  T~ 1 5,  /ip 
«  T~2.  On  the  other  hand,  at  higher  temperatures 
(up  to  500  K)  several  investigators  agree  on  that 
the  mobility  of  both  species  of  carriers  is  well 
described  by  a  relation  of  the  form  /xn  p  =  jimax(T/ 
300)  ~7,  with  2.2  <  7  <  2.3.  The  relative  departure 
of  the  mobility  laws  in  terms  of  the  lattice 
temperature  was  initially  ascribed  to  the  different 
structure  of  the  conduction  and  valence  bands. 
However,  after  implementing  the  full-band  struc¬ 
ture  in  HARM,  a  quantitative  agreement  was 
found  in  a  wide  range  of  temperatures  for  the 
electron  mobility  only;  on  the  contrary,  the 
agreement  of  the  hole  mobility  was  acceptable  at 
300  K,  whereas  a  difference  of  about  20%  was 
found  at  200  and  400  K.  This  behavior  can  be 
observed  in  Figures  1  and  2,  where  the  intrinsic 


Temperature  (K) 

FIGURE  1  Experimental  intrinsic  electron  mobility  as  a 
function  of  temperature  compared  with  the  results  obtained 
with  the  HARM  code. 
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FIGURE  2  Experimental  intrinsic  hole  mobility  as  a  function 
of  temperature  compared  with  the  results  obtained  with  the 
HARM  code.  For  the  latter  two  different  results  are  provided: 
the  one  labeled  as  “HARM  (without  modification)”  refers  to  an 
acoustical-phonon  scattering  model  with  no  dependence  of  the 
coupling  constants  on  temperature;  this  dependence  has  been 
added  in  the  results  labeled  as  “HARM  (with  modification)”. 


phonon  coupling  constants  were  preliminarily 
fitted  on  the  experimental  data  at  300  K.  Then,  a 
model  for  the  temperature  dependence  of  the 


FIGURE  3  Experimental  electron  mobility  as  a  function  of 
impurity  concentration  at  different  temperatures  compared  with 
the  results  obtained  with  the  HARM  code. 


mobility  for  electrons  and  holes  as  a  function  of 
temperature  is  reported.  The  curve  for  holes  which 
corresponds  to  the  physical  model  illustrated  so 
far  is  labeled  “HARM  (without  modification)”. 

Another  effect  whose  temperature  dependence 
proved  relevant  is  the  scattering  with  ionized 
impurities.  To  correctly  describe  such  mechanism 
it  is  necessary  to  take  the  incomplete  ionization 
into  account,  which  becomes  relevant  when  the 
total  impurity  concentration  exceeds  1017  cm”3 
and  produces  a  significant  lowering  in  the  major¬ 
ity-carrier  concentration  with  respect  to  that  of  the 
impurities.  As  already  mentioned,  incomplete 
ionization  has  been  implemented  into  HARM, 
along  with  the  description  of  the  equilibrium 
carrier  concentration  in  terms  of  Fermi  statistics. 

A  final  investigation  is  related  to  the  phonon 
scattering,  whose  effect  is  modeled  here  following 
[4].  The  values  of  the  acoustical-  and  optical- 


FIGURE  4  Experimental  hole  mobility  as  a  function  of 
impurity  concentration  at  different  temperatures  compared  with 
the  results  obtained  with  the  HARM  code. 
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acoustical-phonon  coupling  constant  has  been 
implemented  into  HARM  and  the  values  of  its 
parameters  have  been  derived  from  the  experi¬ 
ments.  The  result  obtained  with  this  additional 
dependence  on  temperature  of  the  acoustical- 
phonon  coupling  constant  is  indicated  in  Figure 
2  with  the  label  “HARM  (with  modification)”. 

A  number  of  runs  have  been  carried  out  to  test 
the  model  described  above  in  a  typical  range  of 
temperatures  as  a  function  of  impurity  concentra¬ 
tion.  The  comparison  with  experimentally-fitted 
curves  of  electron  and  hole  mobility  reported  by 
different  investigators  is  shown  in  Figures  3  and  4: 
in  both  cases  a  satisfactory  agreement  with 
experimental  data  has  been  obtained. 
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An  ensemble  Monte  Carlo  code  has  been  developed  for  the  simulation  of  hole  relaxation 
processes  in  a  GaAs/AlAs  quantum  well.  The  code  includes  a  realistic  k.p  model  of  the 
valence  subbands  and  corresponding  wavefunctions.  Intra-  and  inter-subband  phonon 
scattering  rates  are  calculated  for  polar  and  non-polar  interactions  via  both  optical  and 
acoustic  modes.  The  code  is  used  to  simulate  the  cooling  of  non-equilibrium 
photogenerated  hole  populations.  A  lifetime  of  90  fs  is  extracted  for  optical  phonon 
mediated  depopulation  of  the  4th  subband  at  77  K.  By  contrast,  the  2nd  subband 
exhibits  fast  re-population,  but  slow  de-population,  with  extracted  lifetimes  of  up  to 
160ps.  The  slow  depopulation  is  attributed  to  the  small  energy  separation  of  the  1st  and 
2nd  subbands  (less  than  the  optical  phonon  energy)  and  the  large  density  of  states  in  the 
2nd  subband  off-zone-centre  minimum. 


Keywords:  Monte  Carlo,  intersubband,  quantum  well 


1.  INTRODUCTION 

Interest  in  intersubband  transitions  has  been  given 
new  impetus  by  recent  developments  in  “Quantum 
Cascade”  or  intersubband  lasers  for  Far-Infra- 
Red  (FIR)  operation  [1].  Whilst  the  Quantum 
Cascade  devices  depend  on  electron  intersubband 
transitions,  p- type  quantum  well  systems  offer 
several  advantages  -  high  density  of  states,  wide 
tunability  and  the  possibility  of  normal  incidence 
operation  -  and  FIR  emission  has  already  been 
demonstrated  in  p- type  germanium  devices  [2], 


Design  of  FIR  emitting  devices  requires  detailed 
knowledge  of  the  electronic  bandstructure  and 
transient  carrier  dynamics.  The  Monte  Carlo 
method  is  an  ideal  simulation  tool  for  these 
structures,  since  the  time-dependent  non-equili¬ 
brium  carrier  distribution  function  is  intrinsically 
determined  without  recourse  to  any  approxima¬ 
tion,  enabling  accurate  determination  of  intersub¬ 
band  lifetimes. 

In  this  paper,  an  ensemble  Monte  Carlo  code  is 
used  to  simulate  transient  hole  relaxation  and 
energy  loss  phenomena  in  a  GaAs/AlAs  quantum 
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well.  The  quantum  confined  valence  bandstructure 
is  calculated  using  an  8-band  k.p  scheme,  to 
account  for  heavy-light  hole  mixing  effects.  The 
k.p  wavefunctions  are  used  to  calculate  hole- 
phonon  scattering  rates  for  the  acoustic  deforma¬ 
tion  potential,  polar-optical,  non-polar  optical  and 
piezoelectric  interactions,  via  Fermi’s  Golden 
Rule.  A  total  of  192  distinct  intra-  and  inter¬ 
subband  scattering  processes  are  included  in  the 
simulation.  Degeneracy  (Pauli  exclusion)  effects 
are  also  included.  The  k-space  trajectories  of 
typically  100,000  particles  are  simulated,  and 
cooling  from  a  range  of  non-equilibrium  carrier 
distributions  is  investigated. 

2.  BANDSTRUCTURE  AND  SCATTERING 
RATES 

Figure  1  shows  the  in-plane  hole  energy  dispersion 
for  the  lowest  four  subbands  of  a  100  A  GaAs 
quantum  well.  The  bandstructure  is  calculated  in 
the  axial  approximation,  in  which  k-space  aniso¬ 
tropy  is  not  resolved  [3].  In  practice,  anisotropy  in 
quantum  wells  is  not  as  significant  as  that  in  the 
bulk  GaAs  valence  bands.  Figure  1  clearly  shows 
the  familiar  distortion  of  the  dispersion  curves  due 
to  heavy-light  hole  mixing,  including  the  presence 
of  negative  effective  mass  regions  in  the  second 
and  fourth  subbands,  and  corresponding  off-zone- 
centre  minima.  These  minima  give  rise  to  Van- 
Hove  singularities  in  the  density  of  states,  which 
result  in  strongly  localised  peaks  in  the  scattering 
rates.  When  calculating  the  rates,  we  include  a 
Lorentzian  broadening  function  to  model  lifetime 
broadening  effects  which  will  ‘smear  out’  these 
singularities  [4].  The  k.p  calculation  yields  pairs  of 
degnenerate  eigenstates  which  have  mixed  spin 
parities.  In  this  simulation,  the  spin  states  of 
individual  carriers  are  not  recorded,  and  the  total 
rate  for  a  given  transition  is  obtained  by  summing 
the  scattering  rates  into  the  two  spin-degenerate 
states.  The  phonon  scattering  rates  were  calculated 
via  Fermi’s  Golden  Rule,  assuming  bulk-like 
phonon  modes  throughout.  Figures  2  and  3  show 


FIGURE  1  Energy  vs.  in-plane  wavevector  for  the  first  four 
valence  subbands  in  a  100  A  GaAs/AIAs  quantum  well. 


FIGURE  2  Phonon  scattering  rates  for  a  hole  in  the  2nd 
subband,  as  a  function  of  in-plane  wavevector,  for  a  lattice  tem¬ 
perature  of  Tl  =  77  K,  and  a  total  hole  density  of p  —  10!  1  cm-2. 
The  graph  shows  contributions  due  to  scattering  via  polar 
optical  (POP),  non-polar  optical  (NPO),  acoustic  deformation 
potential  (AC)  and  piezoelectric  (PZ)  processes. 

the  total  scattering  rates  for  a  hole  resident  in  the 
2nd  and  4th  subband,  respectively,  for  a  lattice 
temperature  of  77  K.  In  both  cases,  the  dominant 
scattering  processes  are  those  due  to  polar  optical 
phonons  (Frolich  interaction).  However,  for  a 
carrier  in  the  2nd  subband,  optical  phonon 
emission  is  only  possible  above  a  certain  energy 
(wavevector)  threshold,  whereas  in  the  4th  sub- 
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FIGURE  3  Phonon  scattering  rates  for  a  hole  in  the  4th 
subband:  details  as  for  Figure  2. 

band,  the  4-1  and  4-2  optical  phonon  emission 
processes  are  allowed  at  zero  wavevector.  Note 
that,  in  the  valence  bands,  the  deformation 
potential  (non-polar)  optical  phonon  interaction 
is  not  forbidden  (as  it  is  the  conduction  bands  at 
the  T  point),  although  the  scattering  rates  are 
smaller  than  those  for  the  polar  interaction. 
Acoustic  phonon  scattering  (modelled  here  by  a 
single  rate  to  represent  the  combined  effects  of  the 
LA  and  TA  modes)  is  only  important  at  low 
carrier  energies,  and  the  piezoelectric  interaction  is 
weaker  still,  even  at  77  K.  However,  in  our  zero- 
field  carrier  cooling  simulations,  it  is  essential  to 
include  the  acoustic  phonon  interactions,  since 
they  represent  the  principal  energy  relaxation 
processes  for  low  energy  carriers. 

3.  HOLE  RELAXATION  SIMULATIONS 

The  Monte  Carlo  model  includes  the  four  lowest 
energy  subbands  in  the  valence  band  quantum  well 
of  GaAs/AlAs.  The  subband  energy  dispersions, 
hole  group  velocities,  and  densities  of  states  were 
tabulated  as  functions  of  in-plane  wavevector. 
Similarly,  the  matrix  elements  for  all  permutations 
of  intra-  and  inter-subband  scattering  via  the 
relevant  phonon  modes  were  precalculated  and 


supplied  to  the  simulation  as  data.  For  the  polar 
optical  and  piezoelectric  processes,  the  matrix 
elements  vary  with  the  in-plane  phonon  wavevec¬ 
tor  (<7||),  and  hence  with  scattering  angle.  For  polar 
optical  scattering,  we  approximated  this  depen¬ 
dence  by  a  l/#y  variation  for  all  cases.  Similar 
analytic  functions  were  derived  to  describe  the 
angular  dependence  of  piezoelectric  scattering  [5]. 
When  calculating  the  new  carrier  states  after  an 
acoustic  or  piezoeletric  phonon  scattering  event, 
the  phonon  energy  EAC  was  calculated  using  a 
linear  phonon  dispersion  curve.  For  piezoelectric 
processes,  a  self  consistent  scheme  was  implemen¬ 
ted  to  ensure  that  the  choice  of  q\\  and  the 
corresponding  EAq  satisfied  wavevector  and  en¬ 
ergy  conservation  respectively,  as  well  as  following 
the  appropriate  probability  distribution  for  q y. 
The  optical  phonon  energy  was  assumed  to  be 
independent  of  wavevector. 

The  well  was  taken  to  be  nominally  undoped:  no 
impurity  scattering  was  included  in  the  simulation. 
The  results  shown  here  are  for  simulations  of  an 
ensemble  of  100,000  particles,  whose  energies,  in¬ 
plane  wavevectors,  and  subband  indices  are 
monitored  at  each  timestep.  Degeneracy  (Pauli 
exclusion)  was  included  by  compiling  the  hole 
distribution  function  in  each  subband  at  each 
timestep,  and  accounting  for  band-filling  via  a 
rejection  technique.  Since  the  initial  energy  dis¬ 
tributions  were  highly  non-equilibrium,  a  short 
timestep  (1  fs)  was  chosen  to  ensure  that  band¬ 
filling  was  not  underestimated  due  to  undersam¬ 
pling  of  the  distribution.  The  simulations  were 
carried  out  under  zero  field  conditions:  for  this 
special  case,  the  probability  function  which  de¬ 
scribes  the  distribution  of  free  flight  times  can  be 
inverted  directly,  without  recourse  to  the  self¬ 
scattering  technique.  This  greatly  increases  the 
efficiency  of  the  simulation. 

As  yet,  there  is  very  little  experimental  data  on 
hole  relaxation.  Therefore,  the  aim  of  this  initial 
work  was  to  obtain  information  on  the  hole 
subband  lifetimes  and  the  many-subband  cooling 
process,  rather  than  attempting  to  simulate  photo¬ 
excitation  experiments.  Such  a  direct  comparison 
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would  also  require  inclusion  of  carrier  -  carrier 
interactions,  which  are  not  yet  present  in  the 
simulation.  In  the  following  computer  experi¬ 
ments,  we  assume  that  a  highly  non-equilibrium 
hole  population  has  been  photogenerated  in  an 
upper  subband.  The  initial  energy  distribution  is 
highly  peaked  at  the  assumed  pump  energy,  with  a 
narrow  Gaussian  distribution  of  carrier  wavevec- 
tors  to  model  Pauli  exclusion  within  the  subband. 
Two  cases  are  presented  below:  firstly,  photoge¬ 
neration  in  the  4th  subband,  and  secondly,  into  the 
2nd  subband;  in  both  cases  the  pump  energy 
corresponds  to  the  subband  energy  minimum.  The 
generated  carrier  density  in  both  cases  was  10n 
cm"2. 

Figure  4  shows  the  time  dependence  of  the 
subband  populations  following  carrier  photoge¬ 
neration  in  subband  4.  As  expected,  subband  4 
depopulates  within  1  ps,  due  to  fast  polar  optical 
phonon  scattering  into  subbands  1  and  2.  Note 
that  very  few  carriers  are  scattered  into  subband  3, 
since  the  energy  separation  between  subbands  4 
and  3  is  less  than  the  optical  phonon  energy.  For 
t  <  1  ps,  subbands  1  and  2  are  populated  at 
approximately  the  same  rate;  however,  for 
t  >  1  ps,  carriers  begin  to  transfer  from  subband 
2  to  subband  1.  This  transfer  takes  over  lOOps  to 


FIGURE  4  Simulated  time  dependence  of  the  subband 
populations  (normalised  to  unity)  for  an  initial  population 
photogenerated  in  subband  4:  TL  =  77  K,  p  =  1011  cm-2. 


complete,  since  the  subband  edge  separation  here, 
too,  is  less  than  the  optical  phonon  energy,  and 
most  of  the  transitions  are  mediated  via  acoustic 
phonon  emission  (although  there  is  some  optical 
phonon  emission  from  carriers  located  higher  up 
the  2nd  subband).  One  of  the  striking  features 
about  the  population  dynamics  in  this  case  is  that 
the  2nd  subband  population  rises  to  approxi¬ 
mately  5  times  its  equilibrium  value:  in  other- 
words,  this  subband  acts  as  a  metastable  carrier 
reservoir,  which  populates  quickly,  but  depopu¬ 
lates  much  more  slowly.  This  behaviour  is,  in  part, 
due  to  the  relative  energy  separations  between 
subbands  4  and  2,  and  2  and  1,  but  also  due  to  the 
large  density  of  states  at  the  2nd  subband  (off- 
zone-centre)  energy  minimum. 

We  can  use  the  simulation  data  to  extract  a 
value  for  the  overall  phonon  scattering  lifetime. 
Obviously,  we  already  have  information  on  life¬ 
times  for  individual  phonon  scattering  processes, 
but  in  most  photoluminescence  experiments  these 
processes  cannot  be  separately  resolved,  and  the 
extracted  lifetime  represents  the  combined  effect  of 
all  participant  scattering  processs.  Furthermore, 
the  extracted  result  is  modified  by  the  exact  non¬ 
equilibrium  energy  distribution  of  the  carriers;  a 
quantity  which  cannot  readily  be  obtained  by 
analytic  methods,  but  which  is  inherently  included 
in  a  Monte  Carlo  simulation.  Not  surprisingly,  for 
such  a  complex  system,  we  find  that  the  lifetime 
approximation  itself  only  holds  within  limited  time 
ranges.  For  the  initial  cooling  phase  we  estimate 
an  overall  depopulation  lifetime  for  the  4th 
subband  of  approximately  r4  =  90  fs.  In  contrast, 
in  the  latter  phase  of  the  simulation  where  carrier 
transfer  from  the  2nd  to  1st  subband  occurs,  we 
estimate  a  an  overall  depopulation  lifetime  for  the 
2nd  subband  of  r2  =  90  ps. 

Figure  5  shows  the  energy  loss  rate  per  carrier 
for  the  same  simulation.  The  rate  has  also  been 
resolved  into  components  due  to  intra-  and  inter¬ 
subband  optical  and  acoustic  phonon  scattering, 
respectively.  As  expected,  the  initial  rapid  cooling 
is  primarily  due  to  optical  phonon  emission,  but  a 
crossover  occurs  after  approximately  2ps,  where- 
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FIGURE  5  Total  energy  loss  rate  per  hole,  and  energy  loss 
rates  due  to  intra-  and  inter  subband  optical  and  acoustic 
phonon  processes:  simulation  conditions  as  for  Figure  4. 


upon  acoustic  scattering  becomes  the  dominant 
energy  loss  mechanism. 

Figure  6  shows  the  time  dependence  of  the 
subband  populations  following  carrier  photoge¬ 
neration  in  subband  2.  In  this  case,  only  two 
subbands  participate  in  cooling,  and  we  see  a  slow 
transfer  of  carriers  from  subband  2  to  1.  In  the 
sub-picosecond  phase,  the  transfer  is  characterised 
by  a  lifetime  of  approximately  r2  =  4  ps,  whereas 
in  the  t>  10  ps  phase,  the  extracted  lifetime  is 
t2  =  160ps.  Figure  7  shows  the  energy  loss  rates 
for  this  simulation.  The  total  rate  is  actually 
negative  for  t  <  2ps.  Examination  of  the  simula¬ 
tion  statistics  reveal  substantial  (intrasubband) 
absorption  of  optical  phonons  by  the  2nd  subband 
carriers  in  the  initial  phase  of  the  simulation, 
followed  by  intersubband  optical  phonon  emission 
to  achieve  transfer  into  subband  1.  The  figure  also 
shows  some  initial  intersubband  transfer  by 
acoustic  phonon  emission,  but  this  process  is 
slower  than  the  optical  phonon  absorption,  and 
certainly  much  less  effective  in  exchanging  energy 
with  the  lattice.  It  is  this  intersubband  optical 
phonon  absorption  which  gives  rise  to  the  4ps 
depopulation  lifetime  -  substantially  longer  than 
the  lifetime  in  subband  4,  but  still  much  shorter 
than  typical  acoustic  phonon  scattering  lifetimes. 
For  /  >  2ps,  a  net  energy  loss  is  observed,  and  is 


FIGURE  6  Simulated  time  dependence  of  the  subband 
populations  for  an  initial  population  photogenerated  in 
subband  2:  TL  =  77  K,  p  =  10M  cnT2. 


FIGURE  7  Total  energy  loss  rate  per  hole,  and  energy  loss 
rates  due  to  intra-  and  inter  subband  optical  and  acoustic 
phonon  processes:  simulation  conditions  as  for  Figure  6. 

dominated  by  intrasubband  acoustic  phonon 
emission,  as  carriers  relax  within  subband  1. 
Comparing  Figures  5  and  7  it  can  be  seen  that, 
for  initial  photogeneration  in  subband  4,  some 
intersubband  (2—1)  optical  phonon  emission 
persists  for  t  >  10  ps  whereas,  for  photogeneration 
in  subband  2,  this  is  not  the  case.  This  observation 
explains  the  difference  in  r2  for  the  two  simula¬ 
tions  in  the  t  >  lOps  range:  for  initial  photo  ge¬ 
neration  in  subband  4,  the  average  carrier  energy  is 
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Time  (ps) 


FIGURE  8  Average  hole  energy  vs.  time,  for  an  initial 
population  photogenerated  in  subband  2. 

higher,  and  hence  some  carriers  are  still  able  to 
emit  optical  phonons  even  after  more  than  10  ps  of 
cooling  time;  thus  r2  is  enhanced  by  optical 
phonon  scattering.  On  the  other  hand,  for  photo¬ 
generation  in  subband  2,  the  average  carrier 
energy  is  lower,  and  r2(t  >  10  ps)  is  entirely 
acoustic-phonon  limited.  Figure  8  shows  the  time 
dependence  of  the  overall  average  carrier  energy 
for  this  simulation.  A  small  increase  in  average 
energy  is  indeed  observed  throughout  the  range 
t  <  5ps,  with  a  clear  decrease  thereafter.  Again, 
this  implies  that  the  second  subband  acts  as  a 
bottleneck  to  the  hole  cooling  process.  Clearly, 
this  conclusion  may  be  affected  significantly  by 
carrier -carrier  scattering,  dependent  on  the  extent 
to  which  intersubband  inter -carrier  processes  will 
redistribute  both  energy  and  population.  Calcula¬ 
tions  for  electrons  in  quantum  wells  show  that 
such  processes  occur  over  a  range  of  timescales  of 
the  order  10°  ps  [6]. 


4.  CONCLUSIONS 

An  ensemble  Monte  Carlo  code  has  been  developed 
which  includes  a  realistic  description  of  the  band- 
structure,  density  of  states,  and  phonon  scattering 
rates  for  holes  in  a  GaAs/AlAs  quantum  well. 


Simulations  of  intersubband  hole  relaxation  indi¬ 
cate  that  the  2nd  valence  subband  of  the  quantum 
well  acts  as  a  bottleneck  for  hole  cooling,  due  to 
both  the  small  2-1  subband  energy  separation  and 
the  large  density  of  states  in  the  off-zone  centre 
minimum.  Consequently,  a  2nd  subband  lifetime  as 
long  as  160ps  is  predicted  at  77  K,  compared  to  a 
lifetime  of  90  fs  for  the  4th  subband.  The  large 
difference  in  the  re-population  and  de-population 
rates  of  the  2nd  subband  can  result  in  the  accumu¬ 
lation  of  large  excess  carrier  populations;  which 
implies  that  Far-Infra-Red  photon  emission  may  be 
attainable  between  the  1st  and  2nd  subbands  under 
suitable  excitation  conditions. 
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We  study  the  application  of  the  mixed  finite  elements  method  (MFE)  for  the  bi- 
dimensional  simulation  of  the  simplified  hydrodynamic  and  energy-transport  models. 

The  two  main  points  are  the  use  of  entropic  variables  which  gives  a  symmetric  positive 
definite  problem  and  a  coupled  computation  of  the  equations  for  electrons  which 
requires  a  generalization  of  the  MFE  method  for  vector  valued  problems.  We  give 
numerical  results  on  JFET  and  HEMT  devices. 

Keywords:  Semiconductor  device  simulation,  hydrodynamic  and  energy-transport  models, 
entropic  variables,  mixed  finite  elements,  GMRES  algorithms,  HEMT  devices 


1.  INTRODUCTION  term  from  the  HD  model,  we  obtain  the  Simplified 

Hydrodynamic  (SHD)  model.  Indeed,  this  con- 
The  Drift-Diffusion  (DD)  model,  which  is  cur-  vective  term  can  be  neglected  for  some  applica- 
rently  the  most  usual  model  in  industrial  Simula-  tions,  otherwise  numerical  difficulties  may  occurs 
tion  of  electronic  devices,  is  not  sufficiently  as  in  fluid  dynamics  occurs, 
accurate  for  sub  micrometer  device  modeling.  The  Mixed  Finite  Elements  (MFE)  method, 
The  two  main  contenders  are  the  Hydrodynamic  which  was  first  developed  in  the  study  of  structural 

(HD)  [2,  6,  10]  and  the  Energy  Transport  (ET)  [11,  mechanics,  is  well  adapted  to  these  equations 

1]  models  which  provides  a  better  description  of  because  it  gives  a  good  conservation  of  the 

such  effects  as  velocity  overshoot  and  carrier  currents.  For  the  DD  equations  this  method  was 

heating.  Both  models  can  be  derived  from  the  used  by  Brezzi  et  al  [3,  8]  with  the  carrier 

Boltzmann  Transport  Equation  (BTE)  by  the  concentration  variables  and  by  Marrocco  et  al. 

moments  method.  If  we  remove  the  convective  [5,  7]  with  the  quasi-Fermi  level  variables.  This  last 
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approach  seems  well  adapted  when  considering 
heterojunctions.  Indeed,  the  quasi-Fermi  levels  are 
continuous  at  the  heterojunctions  when  densities 
are  not. 

We  consider  the  bi-dimensional  stationary  case 
and  take  a  simplified  hydrodynamic  model  or  an 
energy  transport  model  for  the  electrons  coupled 
with  a  drift-diffusion  model  for  the  holes  through 
the  Poisson  equation.  We  use  a  Boltzmann 
statistic.  We  express  the  problem  in  a  conservative 
form.  In  order  to  obtain  a  symmetric  positive 
definite  system  of  elliptic  equations,  we  introduce 
appropriate  variables.  We  first  use  the  electrostatic 
potential  <f>,  the  hole  quasi-Fermi  level  <pp  and  for 
the  electrons  we  introduce  entropic  variables  vi 
and  v2  (functions  of  the  electron  quasi-Fermi  levels 
<pn  and  the  electron  temperature  Tm  see  (9))  which 
lead  to  a  symmetric  formulation.  We  consider  the 
usual  “flow”  variables  like  the  electric  displace¬ 
ment  D,  the  hole  current  density  Jp,  the  electron 
current  density  Jn  and  a  “flow”  function  Jw  which 
depends  on  Jn  and  on  the  thermal  energy  flow 
density  Sn  (see  (10)).  The  boundary  conditions  are 
non-homogeneous  Dirichlet  conditions  on  the 
contacts  and  homogeneous  Neumann  conditions 
on  the  insulated  boundaries. 

We  extend  the  numerical  methods  developed  for 
the  DD  model  in  [5,  7]  and  for  the  uni-dimensional 
ET  model  in  [9].  The  solution  of  the  problem  is 
obtained  as  the  limit,  when  t  goes  to  infinity,  of  an 
artificial  transient  problem  discretized  with  local 
time  steps.  We  use  block  relaxation  techniques  as 
we  consider  successively  the  Poisson  equation,  the 
equation  for  the  holes  and  the  two  equations  for 
the  electrons.  The  two  first  equations  are  solved 
like  in  [5,  7].  On  the  other  hand,  in  order  to  treat 
the  electronic  conservation  laws,  we  extend  the 
MFE  method  for  a  vector  valued  problem.  For 
each  case  “implicit  type”  discretization  for  time 
and  mixed  finite  elements  for  space  are  used.  The 
non-linear  problems  are  solved  by  Newton-Raph- 
son  algorithms. 

An  outline  of  the  paper  is  as  follows.  In  section  2 
the  physical  model  is  presented  in  his  symmetric 
form.  We  present  the  general  method  of  solution 


in  section  3.  The  extension  of  the  MFE  method  for 
vector  valued  problems  is  developed  in  section  4. 
And  finally  we  present  numerical  results  on  JFET 
and  HEMT  in  section  5. 


2.  PHYSICAL  MODELS 


We  consider  the  following  problem  on  c  U2 

-di v  D  +  q(N  -  P  -  dop)  =0,  (1) 

—divJp  —  qU  =  0,  (2) 

— div/„  +  qU=  0,  (3) 

— di vJ*  +  (w„  -  q{4>  -x))U-  N— — —  =  0, 

T\V 

(4) 

where  Z),  Jp ,  Jn  and  Jw  are  defined  by 

D  =  eV<j> ,  (5) 

Jp  =  qfipP'S/ifp,  (6) 

<7) 

=  <8> 

and  the  matrix  A  is  defined  by 


with  dop  the  doping  profile,  U  the  net  recombina- 
sion  rate  per  unit  volume,  w„  the  electron  mean 
energy  given  by  (3/2)  kBTn,  w0  the  electron  mean 
energy  at  equilibrium,  rw  the  energy  relaxation 
time  (constant),  TL  the  lattice  temperature,  e  the 
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permittivity,  fip  the  hole  mobility  and  f. tn  the 
electron  mobility  given  by  p°n(TL/Tn)a. 

Let  us  remark  that  the  matrix  A  is  symmetric 
positive  definite.  This  proves  the  interest  of  the 
variables 


Vi  =  and  v2  =  -=! .  (9) 

in  in 

The  link  between  Jw  and  Sn  is  the  following: 

Jw  =  -Sn-(<l>-x)Jn-  (10) 

The  boundary  conditions  are  non-homogeneous 
Dirichlet  conditions  on  the  ohmic  contacts  and 
homogeneous  Neumann  conditions  on  the  insu¬ 
lated  boundaries: 


<t>=g<f> »  P  =  gp,  N  =  gN ,  Tn=gTn  on  T/), 


d(j)  dP  dN_dTn 
dn  dn  dn  dn 


on  Tjy  =  £1  —  T d- 


The  difference  between  the  models  comes  from 
the  definition  of  at,  (3  and  c  :  a  =  1,  (3  =  5/2  and  0 
<  c  <5/2  for  the  SHD  model,  a  =  1,  (3  —  3/2  and 
c  =  0  for  the  Stratton  ET  model  and  a  =1/2 ,  (3 
=  2  and  c  =  0  for  the  Degond  et  al  ET  model. 


3.  NUMERICAL  METHODS 


The  solution  of  the  problem  (1)  — (8)  is  obtained  as 
the  limit,  when  t  goes  to  infinity,  of  the  following 
artificial  transient  problem 


s^dt(j)  -  divi)  +  q(N  —  P  —  dop)  =  0, 


sF*  dtpp  -  di vJp  -  qU  =  0, 


(k 


qU 


- q(4> -  x))u - n'- 


r_ 


=  0, 


(11) 

(12) 

(13) 


Our  choice  of  functions  and  SUj  is 

motivated  by  the  following  considerations:  we 
must  keep  the  homogeneity  of  the  equations,  S 
must  be  a  positive  definite  matrix  and  functions 
5^,  sP*  must  be  strictly  positive  and  we  want  to 
optimize  the  convergence.  In  our  computations, 
we  take  as  in  [5,  7] 


,r  =■ 


(Ax) 


2°’ 


&=■ 


(Ax)' 


and  by  natural  extension 


S  = 


1 

(A xf 


A. 


We  use  block  relaxation  techniques  as  we 
consider  successively  the  Poisson  Eq.  (11),  the 
equation  for  the  holes  (12)  and  the  system  of 
equations  for  the  electrons  (13).  So  we  have  to 
solve  scalar  and  vector  valued  non  linear  parabolic 
equations.  We  use  an  implicit  type  discretization 
for  time  and  mixed  finite  elements  for  space.  The 
non-linear  problems  are  solved  by  Newton-Raph- 
son  algorithms. 

The  two  first  problems  (11,  12)  are  solved  like  in 
[5,  7].  On  the  other  hand,  for  the  computation  of 
the  system  (13),  we  extend  the  MFE  method  for  a 
vector  valued  problem.  We  develop  this  in  the  next 
section. 

For  the  linear  systems,  direct  solvers  (Cholesky 
or  LU  factorization)  are  used.  The  non-symmetric 
linear  system  associated  with  the  coupled  Eq.  (13) 
can  efficiently  (memory/time)  be  solved  via 
GMRES  algorithms  with  block-diagonal  pre-con¬ 
ditioning. 


4.  SOLUTION  OF  THE  TWO  ELECTRON 
EQUATIONS:  EXTENSION  OF  THE  MFE 
METHOD  FOR  A  VECTOR  VALUED 
PROBLEM 


where  D,  Jp ,  Jm  Jw  are  respectively  given  by  Eqs.  At  each  time  step  k ,  after  solving  the  Poisson 
(5-8).  equation  and  the  hole  conservation  equation,  we 
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solve  the  following  problem:  find  v*+1,  v*"1"1,  explicit  formula  for  v^1,  v^1  (on  each  element  e  of 
J„+x,  solution  of  the  following  system:  the  mesh): 


S(x)  (  Vi  fc+1  -  Vl  *  \ 


At  V  Vo  fc+1  -  Vo  * 


+  G(x,  Vl  fc+1 ,  V2*+1)  =0, 


),  (15) 


with  boundary  conditions 


=  -Ae 


vL~vi 


+  GUVU>VL)-<liv(, 


J  /+! 


where  A s  depends  on  v[,  vl2. 


and 


C‘:S) 

ZTn' 

d_  /vi  *+1\ 
Vv2t+1  / 


on  r© 
=  0  on  IV 


5.  NUMERICAL  RESULTS 

We  have  validated  our  method  either  with  uni¬ 
dimensional  devices  such  that  balistic  diodes  (with 
or  without  heterojunctions)  or  p-n  diodes  and  bi- 


Let  us  introduce  the  following  Sobolev  spaces 

H (div)  =  {w|u>  €  (L2(n))p,  div(w)  €  L2(fi)}, 
Vo  =  €  H (div),  uj.n  =  0  on  IV}. 

The  variational  mixed  dual  formulation  is  the 
following  (with  B=A~X)  :  find  (v^vf"1)  G 
(L2(fi))2  and  (/*+1,/*+1)  e  (V0)2  such  that 


Vue  (l2([0,l]))2, 


with  u>i  G  F0. 


The  discrete  form  of  the  problem  can  be  directly 
derived  from  (16,  17)  by  using  Raviart-Thomas 
spaces  of  lowest  degree.  Thus,  we  obtain  a  non¬ 
linear  problem.  We  apply  a  Newton-Raphson 
iterative  technique  wich  gives  at  each  step  /:  a 
linear  system  for  the  only  variables  Jl+X ,  /(+1  and  a 


Polarization  Drain-Source  In  Volts 


FIGURE  1  Current- Voltage  characteristic. 


FIGURE  2  Log  of  the  electron  concentration. 
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dimensional  devices  such  that  JFET  or  HEMT 
with  0.2  pm  gates  length. 

We  give  here  the  charateristic  Is  ( V-DS. )  curves 
and  the  electron  concentration  and  temperature 
distribution  for  the  HEMT  device  for  applied 
potential  Vs  =  OV,  Vd  =  +  IV,  VG  =  -2V.  We  use 
a  mesh  with  20  000  elements  which  gives  30  000 
degrees  of  freedom  for  each  flow  variable. 
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Numerical  simulation  results  of  noise  due  to  current  fluctuations  along  an  n+  -n-n+ 
submicron  structure  are  presented.  The  mathematical  framework  is  based  on  the 
interpretation  of  the  equations  describing  electron  transport  in  the  semiclassical 
transport  model  as  stochastic  differential  equations  (SDE).  According  to  this  formalism 
the  key  computations  for  the  spectral  density  describing  the  noise  process  are  reduced  to 
a  special  initial  value  problem  for  the  Boltzmann  transport  equation  (BTE).  The 
algorithm  employed  in  the  computation  of  the  space  dependent  noise  autocovariance 
function  involves  two  main  processes:  the  stationary  self-consistent  solution  of  the 
Boltzmann  and  Poisson  equations,  and  a  transient  solution  of  the  BTE  with  special 
initial  conditions.  The  solution  method  for  the  BTE  is  based  on  the  Legendre 
polynomial  method.  Noise  due  to  acoustic  and  optical  scattering  and  the  effects  of  non- 
parabolicity  are  considered  in  the  physical  model. 


Keywords:  Stochastic  differential  equations,  Boltzmann  transport  equation,  noise,  Legendre 
polynomials 


1.  INTRODUCTION 

Current  noise  in  semiconductors  is  due  to  the 
inherent  randomness  of  the  scattering  mechanisms 
that  govern  electronic  transport.  These  current 
fluctuations  around  a  stationary  value  are  gener¬ 
ally  characterized  by  the  associated  autocovar¬ 
iance  function  or  equivalently,  by  their  spectral 
density.  Employing  the  machinery  of  SDE,  a  new 


noise  model  [1]  shows  that  the  key  computations 
for  the  noise  autocovariance  function  are  reduced 
to  the  transient  solution  of  the  BTE  with  special 
initial  conditions.  This  novel  approach  was  pre¬ 
viously  utilized  [2]  for  the  computation  of  the  noise 
spectral  density  in  bulk  silicon.  The  BTE  was 
deterministically  solved  using  the  Legendre  poly¬ 
nomials  method  and  the  results  were  in  excellent 
agreement  with  those  obtained  using  the  Monte 
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Carlo  technique.  In  this  paper  we  study  the  impact 
that  the  spatial  variation  of  the  doping  has  on  the 
current  noise  autocovariance  function.  The  subject 
of  our  study  is  a  one-dimensional  n+ -n~n  + 
submicron  structure  in  the  stationary  regime.  The 
outline  of  our  paper  is  as  follows.  Sections  II  and 
III  concentrate  on  the  noise  and  physical  models 
respectively.  Section  VI  describes  the  algorithm 
and  Section  V  presents  the  numerical  results.  The 
last  section  is  devoted  to  the  conclusions. 


2.  THE  NOISE  MODEL 


Xk^i) 


W(kj,  kj  +  Uj) 
A  (ki) 


(3) 


where  kt  =  k(tj)  and  kt  +  w,  =  k(tf). 

These  same  equations  describe  the  electron 
motion  in  Monte  Carlo  simulations. 

In  the  context  of  the  SDE  theory,  these 
equations  correspond  to  a  Markov  process,  which 
is  characterized  by  a  transition  probability  func¬ 
tion  satisfying  the  Kolmogorov-Feller  equation.  In 
the  case  of  semiclassical  transport  the  latter 
equation  is  identical  to  the  linear  (non-degenerate) 
BTE  (eq.  4) 


According  to  semiclassical  transport  theory,  the 
motion  of  an  electron  in  a  semiconductor  is 
described  by  the  following  stochastic  differential 
equations: 

dx  ->  1  ->  7p  ^dk  ->  - 

-=v(*)=jV,-e(*),  d-^h—  =  -qE+F„ 

and  Fr  =  y^j  MtS(t  -  tj),  (1) 


r\ 

(x k  , 0; x, k, t)  +  v(k)  ■  V*p- 
|  E{x,t).V£P  = 

J  p(x', k', 0;  x, ic", r)  W(k", k)dk"  -  \(k)p,  (4) 

subject  to  the  following  initial  condition: 


where  x,  v,  p  and  k  are  the  electron  position, 
velocity,  momentum  and  wave  vector,  respectively, 
E  is  the  electric  field,  e{k)  is  the  energy-wave  vector 
relationship  in  the  given  energy  band  and  Fr  is  the 
random  impulse  force  on  the  electron  due  to 
scattering.  The  random  force  is  characterized  by 
the  transition  rate  W(k,k').  Accordingly,  the 
probability  of  scattering  is  given  by 


Pr{//  -  tt- 1  >  t}  =exp  j  -  jT 


A (k)=  J  W{k,k')dk' 


(2) 


where  A(&)  is  the  scattering  rate.  Therefore,  given 
the  electron  wave  vector  k,  X(k)At  is  the  prob¬ 
ability  that  a  jump  in  momentum  will  occur  in  a 
small  time  interval  At.  Assuming  that  a  scattering 
event  has  occurred  at  some  time  tif  the  probability 
density  function  for  the  amplitude  of  the  jump  is 
given  by, 


p(x,  k’,  0;  x,  ic,  r)|T=0  =  6{k-  k')  ■  6{x  -  x),  (5) 

where  £(•)  is  the  Dirac  delta  function  and  the 
following  notation  has  been  adopted: 

p(x,  ic,  t\x',  k',  0)  =  p{x,  ic',  0;  x,  k,  t)  (6) 

Generally,  noise  in  semiconductors  is  character¬ 
ized  by  the  spectral  density  of  current  fluctuations. 
The  spectral  density  is  defined  as  the  Fourier 
transform  of  the  autocovariance  function.  The 
autocovariance  of  any  random  process  can  be 
found  from  the  transition  probability  density 
function  of  such  random  process.  Since  the 
transition  probability  function  satisfies  the  Kol¬ 
mogorov-Feller  equation,  and  the  BTE  is  identical 
to  it,  the  transition  probability  density  function 
can  be  obtained  from  the  solution  of  the  BTE 
subject  to  appropriate  initial  conditions. 

In  [1]  it  was  shown  that  the  current  longitudinal 
noise  autocovariance  function  can  be  obtained 
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from  the  transient  solution  of  the  BTE  subject  to 
the  following  “special”  initial  condition: 

p{x,  ic,  r)|T=0  =  {v(k)  -  (v )*)/(*,  k).  (7) 

where 

(v>*  =  J  v(k)f(x,k)dk.  (8) 

Here,  in  (7)  f{x,k)  represents  the  steady  state 
solution  of  the  BTE.  The  current  autocovariance 
function  is  computed 

Ky(x)  t)  =  q2  J  v(k)p(x ,  k,  r)dk \  r  >  0.  (9) 

It  is  very  important  to  note  that  this  approach  for 
the  current  noise  autocorrelation  computation  is 
strictly  within  the  framework  of  semiclassical 
transport.  This  approach  directly  connects  the 
physics  of  scattering  with  the  current  noise 
characteristics  and  makes  no  additional  assump¬ 
tions  regarding  the  nature  of  the  noise. 

3.  THE  DEVICE  MODEL 

A  device  under  stationary  conditions  is  appro¬ 
priately  characterized  by  the  Poisson  Eq.  (10)  and 
the  space-dependent  BTE  (11).  The  self-consistent 
solution  of  these  equations  provides  f(x,k),  the 
stationary  probability  density  function  and  0(x), 
the  electrostatic  potential  throughout  the  device. 
In  mathematical  form  these  equations  are  given 
by: 

Vl4>(x)=l[n(x)-N(x)}.  (10) 

where  N(5c)  is  the  doping  concentration  and 
n(x)  =  jf{x,k  )dk  is  the  free  charge  concentration 
and 

v(k)  ■  Wxf(x,k)  -  |iT(x)  •  ^pf— 

J  f(x,  k")  W{k",  k)dk"  -  A  (k)f  (11) 


Since  in  our  example  we  are  using  a  1-D«+-k- 
n+  structure,  E  and  x  are  parallel  and  can  be 
replaced  by  the  scalars  E  and  x.  It  is  also  assumed 
that  E  is  in  the  symmetric  [1  1  1]  crystallographic 
direction  and  therefore  a  single  band  distribution 
function  accurately  represents  the  state  of  the 
momentum  space. 

4.  THE  ALGORITHM 

The  space-dependent  noise  auto-correlation  func¬ 
tion  is  computed  according  to  the  following 
straight-forward  procedure: 

1.  Obtain  f(x,k)  and  4>(x)  from  the  self  consistent 
solution  of  the  Poisson  Eq.  (10)  and  space- 
dependent  BTE  (11).  The  self-consistent  solu¬ 
tion  to  the  Poisson  and  Boltzmann  equations  is 
achieved  by  concurrently  solving  the  corre¬ 
sponding  discretized  equations  in  a  Gummel- 
type  iteration.  This  process  comprises  the 
following  steps: 

(a)  Generate  initial  values  for  <j)(x)  and  n(x) 
using  the  Drift-Diffusion  model. 

(b)  Generate  initial  values  for  / (x,  k )  consistent 
with  (p(x )  and  n(x)  calculated  in  step  (a), 
solving  the  homogeneous  BTE. 

(c)  Obtain  fi+1(x,k)  from  fl(x}  k)  solving  the 
space-dependent  BTE  until  the  current 
along  the  device  converges.  The  BTE  is 
solved  using  the  electric  field  that  corre¬ 
sponds  to  cj)l(x ). 

(d)  Solve  Poisson  Eq.  (10)  for  <fil+l(x)  given 
nl+l(x). 

(e)  Steps  (c)  and  (d)  are  repeated  until  the  whole 
system  converges. 

2.  Compute  special  initial  condition  g(x,k,r) |r=0 
as  prescribed  by  (7). 

3.  Obtain  g(x,A:,  r)  solving  the  transient  BTE  (4) 
with  the  special  initial  condition  computed  in 
step  2,  and  the  electric  field  E(x)  corresponding 
to  <j>(x)  obtained  in  step  1.  The  transient 
solution  to  the  BTE  is  based  on  implicitly 
approximating  the  time  derivative  in  Eq.  (4). 
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Therefore  g(x,k,rk+x)  can  be  computed  from 
the  solution  of  Eq.  (4)  and  the  knowledge  of 
g(x,  k ,  rk)9  for  any  k.  This  process  is  comprised 
of  the  following  steps: 

(a)  Generate  initial  guess  values  for  g(x9 k,rM) 
=  g(x,k,Tk) 

(b)  Obtain  gJ+I(x,k, rk+l)  from  gl(x,ic9rk+l) 
solving  the  transient  BTE,  Eq.  (4),  until 
total  current  converges. 

(c)  Advance  the  time  index  and  go  to  step  (a). 

4.  Compute  the  autocovariance  function  accord¬ 
ing  to  Eq.  (9). 

5.  SOLUTION  OF  THE  BTE  EMPLOYING 
LEGENDRE  POLYNOMIALS 

In  order  to  map  ellipsoidal  energy  surfaces  into 
spherical  ones,  the  Herring-Vogt  transformation  is 
employed.  In  this  transformation  the  original 
coordinates  x,  E  and  k  are  mapped  into  x*,E 
and  k  ,  respectively  and  the  dispersion  relation¬ 
ship  becomes  spherical  7  (e)  =  e  +  j3  e2  =  h2  k *2/ 
2 m0  in  the  new  domain.  The  BTE  remains 
unchanged  in  the  stared  variables  and  the  direction 
of  E  defines  a  symmetry  axis  (2).  The  density 
function  dependence  on  momentum  can  be  ex¬ 
pressed  in  terms  of  only  two  independent  vari¬ 
ables:  e  and  6  .  We  expand /(x*, k*9  t )  in  Legendre 
polynomials  according  to: 

/(x* ,k*,t)=fo (x* , e,  t)  +  k*g(x* ,  e,  t)  cos  0 

+ifc*2A(x*,c,  0(3  cos  20-  1).  (12) 

— 

This  representation  for  f(x*9k  ,  t),  is  replaced  in 
Eq.  (4).  Recalling  that  the  Legendre  polynomials 
are  orthogonal,  the  resulting  equation  is  solved  for 
the  corresponding  coefficients  independently.  This 
results  in  a  system  of  three  coupled  differential- 
difference  equations  for  the  functions  /0,  g  and  h , 
in  terms  of  t ,  x*  and  e .  The  system  is  solved  for 
/o(x*,  e9  t)  in  the  energy  domain  using  standard 
finite  difference  method.  The  functions  g(x*,  e,  t) 
and  h(x*,  e9  t)  are  then  obtained  from  /o(x*,  e,  t). 


6.  NUMERICAL  RESULTS 

Figure  1  shows  the  parameters  employed  in  the 
simulation:  A  Gaussian  doping  profile  ranging 
from  1016  to  1017  cm-3  with  the  highest  doping  at 
the  device  boundaries  and  minimum  in  the  middle, 
2  Volts  of  applied  voltage,  a  device  length  of  0.5 
pm,  etc.  Figure  2  shows  the  space-dependent 
current  noise  autocovariance  function  computed 
for  this  device.  Several  comments  are  in  order:  a) 
The  maximum  value  of  cr2,  the  noise  power,  occurs 
at  the  0.15  pm  point  in  which  the  electrostatic  field 
reaches  its  maximum  value.  This  result  had 
already  been  observed  for  Bulk  silicon  in  [2],  b) 
The  auto-covariance  function  remains  positive  for 
points  in  the  middle  of  the  device  for  which  the 


0.5  (xm 

^ ^ 


FIGURE  1  shows  the  device  considered  in  our  simulations. 
The  device  is  0.5  gm  long,  the  applied  potential  is  equal  to  2 
volts,  the  doping  profile  is  Gaussian  and  symmetrical  with  its 
lowest  value  in  the  middle  of  the  device. 


noise  auto— covariance  function 


FIGURE  2  shows  the  current  longitudinal  noise  autocorrela¬ 
tion  function  corresponding  to  our  simulations.  The  value  of  a2 
reaches  it  maximun  value  at  0.15  pm,  point  for  which  the 
electrostatic  field  has  reached  its  maximum  value. 
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spectral  density 


FIGURE  3  shows  the  spectral  density  function  corresponding 
to  Figure  2.  This  figure  shows  that  the  noise  at  0.15  pm,  point 
for  which  the  electrostatic  field  has  reached  its  maximum  value, 
has  important  high  frequency  components. 

doping  reaches  its  minimum  and  c)  the  auto¬ 
covariance  functions  at  both  ends  are  identical,  a 
result  that  is  in  complete  agreement  with  the 
assumed  space-independence  condition  for  the 
boundaries  of  the  device.  Figure  3  shows  the 
corresponding  spectral  density  function  to  Fig¬ 
ure  2.  The  most  important  feature  of  this  figure  is 
that  higher  frequency  components  are  observed 
again  for  the  points  in  the  area  of  large  electrotatic 
field. 

7.  CONCLUSIONS 

Numerical  results  for  the  space-dependent  auto¬ 
covariance  and  spectral  density  function  of  noise 
due  to  acoustic  and  optical  phonon  scattering  were 
presented.  A  simple  algorithm  to  compute  the 
noise  auto-correlation  function  that  accounts  for 
the  effect  of  the  spatial  variation  of  the  doping  was 
described.  The  algorithm  was  succesfully  em¬ 
ployed  for  noise  calculations  for  a  sub-micron 
stucture  in  stationary  regime.  Hence  the  SDE  noise 
model  is  demonstrated  to  be  a  viable  approach  for 
space-dependent  noise  computations  . 
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Good  agreement  between  a  hydrodynamic  and  a  Monte-Carlo  device  model  is 
demonstrated  in  this  paper  for  an  advanced  SiGe  Heterojunction  Bipolar  Transistor. 
This  result  is  based  on  two  principles:  1)  Extraction  (from  the  Monte-Carlo  bulk  model 
under  homogeneous  conditions)  of  the  relaxation  times  r  at  discrete  points  of  the 
parameter  space  spanned  by  the  Ge-content  x,  doping  density  N,  carrier  temperature  Tc 
and  lattice  temperature  TL.  2)  Modeling  of  the  relaxation  times  r(x ,  N ,  Tc ,  Tl)  by 
splines. 


Keywords:  Silicon-Germanium  (SiGe),  Hetereo  Bipolar  Transistor  (HBT),  hydrodynamics, 
transport  parameter,  spline  model,  GHDM,  Monte-Carlo 


1.  INTRODUCTION 

Among  all  SiGe  device  concepts,  SiGe  Hetero¬ 
junction  Bipolar  Transistors  (HBTs)  currently 
have  the  highest  potential  for  commercial  applica¬ 
tions.  In  order  to  support  the  design  of  SiGe 
HBTs,  accurate  and  efficient  device  simulation 
tools  are  necessary.  However,  though  even  some  of 
the  commercially  available  simulators  offer  the 
capability  of  simulating  heterojunction  devices, 
reliable  transport  parameters  for  these  devices  are 
not  available  for  most  design  tasks.  The  under¬ 


lying  reason  for  this  dilemma  is  that  the  Ge 
content  x  is  variable  in  SiGe  devices,  which  has 
added  an  additional  dimension  to  the  problem  of 
determining  transport  parameters. 

For  example,  the  relaxation  times  of  a  hydro- 
dynamic  (HD)  model  for  SiGe  HBTs  r(N,  x,  Tc, 
Tl)  depend  on  four  independent  quantities  instead 
of  three,  namely  the  total  doping  density  N ,  the  Ge 
content  x,  the  carrier  temperature  Tc  and  the  lattice 
temperature  TL.  Therefore  the  traditional  ap¬ 
proach  of  extracting  transport  parameters  predo¬ 
minantly  from  experimental  data,  which  worked 
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well  for  silicon  for  a  long  time,  is  no  longer  feasible 
for  SiGe  devices  because  reliable  experimental  data, 
especially  for  strained  SiGe,  are  hardly  available. 


2.  TRANSPORT  PARAMETERS 

In  order  to  overcome  this  problem  the  compre¬ 
hensive  and  experimentally  verified  Monte-Carlo 
bulk  transport  model  described  in  [1]  was  applied 
to  generate  transport  parameters  for  the  drift- 
diffussion  (DD)  or  HD  device  simulations  at 
discrete  mesh  points  of  the  4D  space  spanned  by 
N,  x ,  Tc ,  Tl. 

To  generate  the  smooth  functions  r(N,  X,  Tc, 
Tl ),  that  are  needed  for  example  for  HD  device 
simulations  from  the  resulting  table  of  transport 
data,  a  flexible  monotonicity  preserving  spline 
interpolation  scheme  has  been  developed. 


3.  THE  SPLINE  MODEL 

Compared  to  the  traditional  approach  of  using 
closed  form  analytic  expressions  with  only  few 
model  parameters  for  r(7V,  x,  Tc ,  T£)  our  spline 
approximation  scheme  has  several  advantages: 
For  example,  it  adapts  itself  automatically  to 
model  extensions  like  an  extended  range  of  N ,  x, 
Tc  or  Tl.  Moreover,  it  is  easy  to  control  the 
accuracy  of  the  spline  interpolation  by  just 
generating  a  denser  table  of  transport  data  for 
the  range  of  N ,  x,  TL,  Tc,  that  is  of  highest 
interest.  In  addition,  our  method  is  also  capable  of 
working  on  a  non-rectangular  grid  in  the  N-x-TL- 
Tc  space. 

The  underlying  algorithm  considers  the  func¬ 
tional  dependencies  of  the  transport  parameters  in 
two  stages.  First,  the  dependence  on  variables  not 
being  influenced  by  device  simulation  like  N,  xGe 
and  sometimes  TL  are  considered  by  multilinear 
interpolation.  In  order  to  process  data  on  a  non- 
rectangular  grid  the  algorithm  performs  a  multi¬ 
dimensional  search  of  the  nearest  neighbour  data 
points  suitable  for  the  interpolation. 


The  second  stage  deals  with  dependencies  on 
variables  of  the  hydrodynamic  equations  itself,  like 
the  carrier  temperature  Tc .  This  is  considered  by 
using  a  variant  of  cubic  splines  called  Akima 
Subsplines  [2].  The  spline  curves  generated  by  this 
method  do  not  contain  artificial  oscillations  which 
can  cause  artificial  modeling  results  or  conver¬ 
gence  problems  for  the  solution  algorithm. 

The  final  result  that  influences  the  memory 
requirements  of  the  device  simulator  is  a  set  of 
spline  coefficients  for  each  grid  point  of  the  device. 
For  medium  grid  sizes  (3000  points)  roughly  20 
MBytes  of  memory  are  necessary  to  hold  the  spline 
coefficients. 


4.  SIMULATION  RESULTS 

In  order  to  verify  the  validity  of  our  modeling 
approach  we  have  simulated  the  two-dimensional 
SiGe  HBT  structure  shown  in  Figure  1.  The 
structure  is  very  demanding  for  numerical  device 
models  because  of  its  narrow  base  and  piecewise 
constant  profiles  for  N  and  x  that  give  rise  to 
abrupt  junctions  and  steps  in  the  valence  and 
conduction  band  edges.  Consequently  this  device 
is  well  suited  for  testing  the  modeling  accuracy  of 


oH 


ioo- 

150- 

170" 


370- 

470- 

x/nm 


0  50 

_i _ i _ 


250 


450 


y/nm 


B 

if  -emitter 

n-emitter 

ff  -SiGe  base 

n-collector 

rf  -collector 

19  -3 

Nn=1-10  cm 


ND=5-1018cm3 
N.=1  •1019crri3 

*G?0'2 
*  .  J7  -3 

Nd=2-10  cm 


Nn=1*1019cm 


FIGURE  1  The  2D  SiGe-HBT  test  structure. 
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classical  device  simulation  in  comparison  to 
Monte-Carlo  device  simulation.  The  structure 
has  been  simulated  by  three  different  models. 
The  first  model  is  the  DD  model.  The  second 
model  is  an  extension  of  the  Generalized  HD 
Model  reported  in  [3]  for  devices  with  position- 
dependent  band  structure.  Moreover,  the  heat  flux 
reduction  that  has  been  proven  to  be  beneficial  for 
ultra  short  MOSFETs  [4]  has  been  adopted.  The 
third  model  is  a  newly  developed  Monte-Carlo 
device  model  for  SiGe  heterojunction  devices. 
Details  of  this  MC  model  will  be  published 
elsewhere.  Since  all  transport  parameters  of  the 
DD  and  HD  simulators  have  been  derived  from 
the  SiGe  MC  bulk  model  and  because  the  MC 
device  model  used  exactly  the  same  band  structure 
and  scattering  models  as  the  MC  bulk  simulator, 
all  three  models  are  fully  consistent  under  homo¬ 
geneous  material  and  field  conditions.  Moreover, 
all  three  device  models  use  exactly  the  same  offsets 
for  the  valence  and  conduction  band  edges. 

The  results  of  the  2D  simulations  for  V^E  =  0 .75 
V,  KCe  —IV  and  300  K  are  summarized  in 
Figures  2-5  for  the  electric  potential,  the  dynamic 
temperature,  the  drift  velocity  and  the  electron 
density.  In  all  cases  the  results  of  the  three  models 
are  shown  along  the  vertical  line  at  y  =  450  nm.  It 
can  be  seen  that  at  the  base  collector  junction  even 


in  the  MC  model  the  drift  velocity  is  more  than  a 
factor  of  two  higher  than  the  maximum  drift 
velocity  under  homogeneous  field  conditions. 
Despite  this  overshoot,  which  would  be  extremely 
large  for  Si-based  devices,  it  can  be  clearly  stated 
that  the  hydrodynamic  results  are  in  good  agree¬ 
ment  with  the  MC-results.  Especially  the  electron 
density  profile  in  the  base  and  the  space  charge 
region,  which  is  important  for  the  transient 
behavior  of  the  HBT,  agrees  well  for  the  HD 
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FIGURE  2  Comparison  of  the  electrostatic  potential  profiles 
resulting  from  the  DD,  HD  and  MC  models.  All  models  are  in 
good  agreement. 


FIGURE  3  Comparison  of  the  dynamic  temperature  profiles  resulting  from  the  HD  and  MC  models.  Both  model  are  in  good 
agreement. 
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VCE=1V,  Vbe=0.75V 


FIGURE  4  Comparison  of  the  electron  densities  resulting  from  the  DD,  HD  and  MC  models.  The  HD  and  MC  density 
distributions  are  in  good  agreement.  The  DD  model  deviates  substantially. 


X  [|Am] 

FIGURE  5  Comparison  of  the  drift  velocity  profiles  resulting  from  the  DD,  HD  and  MC  models.  Please  note  the  velocity 
overshoot  which  is  extreme  for  Si-based  devices. 


and  the  MC  simulation.  On  the  other  hand,  the 
DD  simulation  deviates  much  more  from  the  MC- 
reference,  which  may  lead  to  intolerable  errors  of 
the  DD-model  for  agressively  scaled  SiGe  HBTs  in 
the  near  future  [5]. 

To  the  best  of  our  knowledge  the  results 
reported  in  this  paper  represent  the  first  2D 
simulations  of  a  SiGe  HBT  with  fully  consistent 
DD,  HD  and  MC  device  models. 
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A  new  numerical  method  for  semiconductor  device  simulation  is  presented.  The  additive 
decomposition  method  has  been  successfully  applied  to  Burgers’  and  Navier- Stokes 
equations  governing  turbulent  fluid  flow  by  decomposing  the  equations  into  large-scale 
and  small-scale  parts  without  averaging.  The  additive  decomposition  (AD)  technique  is 
well  suited  to  problems  with  a  large  range  of  time  and/or  space  scales,  for  example, 
thermal-electrical  simulation  of  power  semiconductor  devices  with  large  physical  size. 
Furthermore,  AD  adds  a  level  of  parallelization  for  improved  computational  efficiency. 
The  new  numerical  technique  has  been  tested  on  the  1-D  drift-diffusion  model  of  a  p-i-n 
diode  for  reverse  and  forward  biases.  Distributions  of  </>,  n  and  p  have  been  calculated 
using  the  AD  method  on  a  coarse  large-scale  grid  and  then  in  parallel  small-scale  grid 
sections.  The  AD  results  agreed  well  with  the  results  obtained  with  a  traditional  one- 
grid  approach,  while  potentially  reducing  memory  requirements  with  the  new  method. 

Keywords:  Numerical  methods,  semiconductor,  simulation,  drift-diffusion,  decomposition 


INTRODUCTION 

The  numerical  method,  additive  decomposition, 
has  been  successfully  applied  in  mechanical  and 
chemical  engineering  to  Burgers’  equation  and  the 
Navier-Stokes  equations  governing  turbulent  fluid 
flow  by  decomposing  governing  equations  into 


large-scale  and  small-scale  parts  without  aver¬ 
aging,  e.g.,  [1-3].  The  additive  decomposition 
(AD)  technique  is  well  suited  to  problems  with  a 
large  range  of  time  and/or  space  scales,  for 
example,  thermal-electrical  simulation  of  power 
semiconductor  devices  with  large  physical  size. 
Thermal-electrical  effects  have  a  large  range  of 
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time  scales  since  the  electrical  time  constants  are 
much  faster  than  the  thermal  time  constants. 
Furthermore,  additive  decomposition  adds  a  level 
of  parallelization  for  improved  computational 
efficiency.  Thus,  semiconductor  device  simulation 
is  a  natural  application  of  the  additive  decomposi¬ 
tion  numerical  technique.  Initially,  we  decompose 
the  simplest  device  equations,  the  drift-diffusion 
model,  to  test  the  method.  After  successful  im¬ 
plementation  for  the  drift-diffusion  equations,  we 
plan  to  apply  the  new  approach  to  the  hydro- 
dynamic  semiconductor  model. 


ADDITIVE  DECOMPOSITION 

OF  THE  DRIFT-DIFFUSION  EQUATIONS 

The  standard  drift-diffusion  model  of  semiconduc¬ 
tors  consists  of  the  following  equations.  Poisson’s 
equation  is 

V24>  =  ~^{p-n  +  Nd-Na)  (1) 

and  the  continuity  equations  for  electrons  and 
holes  are 

dti-  1 

=  -  V[£>„  •  Vn  -  fi„  -n  •  V<£]  -  R  (2) 
dp  1 

di  =  “~VHVV^  -  Vp-P-V4>]  -  R  (3) 

where  the  symbols  have  their  standard  mean¬ 
ing  [4]. 

Additive  decomposition  of  the  drift-diffusion 
semiconductor  model  proceeds  as  follows.  First, 
the  basic  variables,  electrostatic  potential,  electron 
density  and  hole  density,  are  divided  into  large- 
scale  and  small-scale  components: 

<£  =  <£  +  <£*,  n  =  fi  +  n*,  p=p+p*.  (4) 

here  4>  is  the  large-scale  component  and  <£*  is  the 
small-scale  component  of  the  potential,  and 
similarly  for  n  and  p.  After  substituting  Eq.  (4) 
into  Eq.  (1)  through  (3),  the  equations  are  then 


decomposed  additively  into  large-scale  and  small- 
scale  equations: 


V20  =  —  (»  -  P  -  Nda )  (5a) 

N*J  (5b) 


where  Nda  =  Nd—Na,  and 


dn 

lft  =  w 


D„Vn  -  fi„nV<t>  -  /x„  •  ( 1-/3) 


(nV<f>*  +  n*V<f>) 


-  R 


dn* 

dt 


=  V 


DnVn*  ~  -  pn-P 


(wV^*  +  n*V0) 


R* 


(6a) 


(6b) 


Equation  (5a)  is  the  large-scale  Poisson’s  equa¬ 
tion,  (5b)  is  the  small-scale  Poisson’s  equation, 
(6a)  is  the  large-scale  current  continuity  equation 
for  electrons,  and  (6b)  is  the  small-scale  current 
continuity  equation  for  electrons.  The  decomposi¬ 
tion  of  the  current  continuity  equation  for  holes  is 
similar  to  (6).  The  additive  decomposition  is  done 
in  such  a  manner,  that  Eqs.  (5a)  and  (5b)  added 
together  are  equivalent  to  Eq.  (1),  and  in  the  same 
way,  Eqs.  (6a)  and  (6b)  added  together  result  in 
the  original  Eq.  (2). 

The  decomposition  parameter  (3  is  a  coupling 
coefficient  between  large-scale  and  small-scale 
solutions  [1,  2].  The  parameter  (3  controls  the 
large-scale/small-scale  interaction.  For  example,  in 
the  limit  /3  =  1,  the  effect  of  large  scale  changes  is 
transferred  to  the  small  scale,  but  not  vice  versa.  In 
principle,  (3  can  be  assigned  any  value  between  0 
and  1  implying  nonuniqueness  of  the  decomposi¬ 
tion,  but  Brown  et  al  [5]  have  recently  demon¬ 
strated  optimal  values  for  the  incompressible 
Navier-Stokes  equations.  In  the  present  study, 
(3  —  1  has  been  used. 

Decomposition  of  the  variables  in  the  recombi¬ 
nation  term  R  requires  a  special  treatment.  In  the 
Shockley-Read-Hall  expression  for  recombina¬ 
tion-generation,  n  and  p  appear  in  the  numerator 
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as  well  as  in  the  denominator.  This  makes  a 
straight  implementation  of  the  additive  decom¬ 
position  impossible.  One  possible  solution  to  this 
problem  is  to  calculate  the  value  of  the  denomi¬ 
nator  with  the  n  and  p  values  from  the  previous 
iteration.  Then,  treat  the  denominator  as  a 
constant  and  decompose  only  the  numerator, 
using  the  decomposition  parameter  /?,  which  will 
generate  both  R *  and  R.  Another  possible  method 
is  to  linearize  R  by  expanding  in  a  Frechet-Taylor 
series,  and  then  decompose  the  basic  variables  n 
and  p  of  the  linearization  in  the  usual  way.  In  the 
preliminary  tests  of  the  AD  method  presented 
here,  the  authors  used  the  first  method  to 
decompose  the  recombination  term.  It  is  impor¬ 
tant  to  note  that  the  decomposition  has  been 
designed  so  that  sums  of  the  large  and  small-scale 
equations  are  the  original  Eqs.  (1-3).  For  linear 
problems,  this  represents  a  straightforward  ap¬ 
proach,  the  validity  of  which  is  trivial  to  demon¬ 
strate.  For  problems  of  the  type  considered  here, 
the  proof  is  more  involved,  and  we  refer  the  reader 
to  Brown  et  al  [5]  for  an  indication  of  what  is 
needed. 

For  a  1-D  simulation,  each  large-scale  equation 
is  solved  on  the  large-scale  grid  with  N  points 
across  the  device  (N  is  quite  a  small  number),  and 
the  small-scale  equations  are  solved  on  the  small- 
scale  grid  over  a  section  of  the  device  centered 
around  a  large-scale  grid  point;  see  Figure  1 .  There 
is  a  separate  set  of  small-scale  equations  (for  small- 


N  large-scale  grid  points 

/  \ 

I - 1  ;  l  ;  I - 1 - 1 - 1 
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V 

small-scale  grid 
for  one  section 

FIGURE  1  Illustration  of  large-scale  and  small-scale  dis¬ 
cretization  grid. 


scale  space  or  time  points)  for  each  large-scale  grid 
point.  Thus,  the  small-scale  equations  can  be 
solved  independently  and  in  parallel  in  N  sections. 
The  large-scale  solution,  which  depends  on  the 
small-scale  solution,  combines  these  to  produce 
total  results  over  the  whole  simulation  domain. 

The  large-scale  function  may  be  defined  in 
various  ways  between  the  large-scale  grid  points. 
In  our  first  calculations,  the  large-scale  values  were 
set  constant  within  one  section,  equal  to  the  values 
obtained  from  the  large-scale  solution  for  the 
large-scale  point  of  the  section;  i.e.,  the  large-scale 
solution  representation  is  piecewise  constant  when 
used  in  the  small-scale  equations.  The  small-scale 
values  are  variations  of  the  exact  solution  around 
the  (local)  constant  large-scale  value  within  one 
section.  The  total  small-scale  function  along  the 
entire  device  length  is,  of  course,  not  continuous, 
although  the  derivative  of  the  small-scale  function 
is  the  same  as  the  derivative  of  the  total  solution 
where  both  are  well  defined.  We  also  tried  the 
approach  which  is  a  linear  interpolation  of  the 
large-scale  solution  between  the  large-scale  grid 
points.  In  this  case,  the  small-scale  function  is 
continuous  but  has  discontinuous  slope  between 
sections.  Nevertheless  the  convergence  of  the  final 
solution  was  much  worse  with  this  method  than 
with  the  first  one. 

Boundary  conditions  between  the  separate 
sections  pose  a  significant  problem,  particularly  if 
the  sections  are  computed  in  parallel.  Two  types  of 
boundary  conditions  have  been  tested,  viz.,  the 
Neumann  type  and  the  Dirichlet  type,  and  it  has 
appeared  that  properly  calculated  Dirichlet  condi¬ 
tions  result  in  more  stable  solutions. 


NUMERICAL  EXPERIMENTS 

As  a  first  test,  the  new  AD  technique  has  been 
applied  to  the  1-D  solution  of  the  drift-diffusion 
transport  equations  of  the  p-i-n  diode  for  reverse- 
bias  and  forward-bias  conditions.  The  test  p-i-n 
diode  structure  is  presented  in  Figure  2.  As  a 
computational  platform,  the  PASSC  semiconduc- 
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boring  sections  overlapping,  the  sections  included 
from  5  to  12  small-scale  grid  points  for  7  sections, 
and  15,  9  and  18  small-scale  grid  points  per  section 
in  the  case  of  3  sections.  After  the  large-scale 
solution  has  been  calculated,  the  small-scale  initial 
guess  values  are  updated  by  interpolation  using  the 
large-scale  solution.  Then  the  accurate  small-scale 
solutions  are  computed  in  separate  (parallel) 
sections  around  each  large-scale  point.  The  final 
results  for  the  forward  bias  VAK=  1.0  V,  calcu¬ 
lated  with  the  large  grid  of  N  =  5,  are  shown  in 
Figure  3.  These  results  are  very  close  to  the 
reference  results  obtained  with  the  standard  one- 
grid  method.  The  biggest  problem  in  this  approach 
is  setting  up  appropriate  boundary  conditions 
between  sections,  because  the  final  results  are  very 
sensitive  to  the  slightest  change  in  boundary 
conditions  of  each  section.  The  current  density 

FIGURE  2  Potential  distribution  for  zero  bias  and  doping 

profile  of  the  best  p-i-n  diode  structure.  t  u  j  u  -  notent  iai 


tor  simulation  program  [6]  has  been  used,  with 
extensive  physical  models  included  [7,  8].  In  the 
present  calculations,  the  decomposition  parameter 
/?  has  been  set  to  1.  The  distributions  of  electro¬ 
static  potential  <f b,  electron  concentration  n  and 
hole  concentration  p  have  been  calculated  as  basic 
results.  As  a  measure  of  quality  of  computed 
results,  the  space  charge  formation  has  been  used 
in  the  case  of  zero  bias  [9]  and  reverse  bias,  and  the 
distributions  of  «,  and  p  as  well  as  current 
density  in  the  case  of  forward  bias. 

First,  as  a  reference,  the  solution  with  the 
traditional  method  has  been  obtained,  using  a 
non-uniform  grid  with  40  points  in  one  dimension. 
Then,  a  large-scale  solution  has  been  computed  on 
N  grid  points  uniformly  distributed  along  the 
device.  In  the  present  study,  we  have  tried  N  =  9 
and  N  =  5.  Small-scale  results  have  been  calcu¬ 
lated  in  separate  sections  surrounding  each  large- 
scale  point,  except  the  border  ones,  which  resulted 
in  7  or  3  separate  sections  along  the  device, 
respectively.  With  the  boundary  points  of  neigh¬ 
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from  the  reference  results  for  VAJc  =  +  1.0  V  was 
134.7  A/cm2.  The  current  density  calculated  using 
only  the  large-scale  solution  was  168.4  A/cm2  for 
N=5  (error  25%),  and  135.8 A/cm2  for  N=9 
(error  0.8%),  and  the  values  were  uniform  along 
the  entire  device  length.  After  solving  the  small- 
scale  equations  in  each  section,  the  final  current 
density  values  obtained  with  the  AD  method  were 
slightly  different  in  different  sections  (within  10%), 
which  was  due  to  inaccurate  small-scale  boundary 
conditions  for  separate  sections.  Iterating  the 
small-scale  results  to  recalculate  small-scale  boun¬ 
dary  conditions  and  re-solving  small-scale  equa¬ 
tions  improved  the  uniformity  of  the  resulting 
current  density,  although  the  rate  of  improvement 
was  slow. 

In  a  similar  manner,  results  for  the  reverse  bias 
were  calculated.  As  mentioned  already,  in  this 
case,  the  space  charge  formation  has  been  used  as 
a  measure  of  accuracy  of  results.  Figure  4  shows 
the  net  charge  ( Q  =  p—n  +  Nd—  Na)  calculated  on  a 
non-uniform  single  grid  (40  points  in  1-D)  with  the 
traditional  method  for  the  reverse  bias  VAK 
=  —  1.0  V.  The  net  charge  distribution  calculated 
only  with  the  coarse  large-scale  grid  (N  =  9)  is 
presented  in  Figure  5a.  After  solving  the  small- 
scale  equations  in  separate  sections  and  adding  the 
large  and  small  components  according  to  Eq.  (4), 
the  resulting  space  charge  distribution  (Fig.  5b) 
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FIGURE  5  Net  charge  distributions  for  the  reverse  bias 
-1.0  V:  a)  results  calculated  using  only  the  large-scale  grid  with 
9  points  uniformly  distributed,  b)  results  obtained  using  AD 
method  after  adding  the  large-scale  and  small-scale  components 
(the  thicker,  circled  grid  lines  correspond  to  the  large-scale 
grid);  note  the  good  agreement  with  the  reference  results  in 
Figure  4. 
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FIGURE  4  Net  charge  (Q  =  p-n  +  Nd- Na)  for  the  reverse 
bias  VAK  —  -1.0V,  calculated  with  the  traditional  method  on 
one  non  uniform  discretization  grid. 


was  almost  identical  to  the  reference  space  charge 
distribution  obtained  with  the  traditional  method 
(Fig.  4).  While  the  current  density  for  the  reverse 
bias  calculated  using  only  the  coarse  large-scale 
grid  is  the  same  order  as  the  reference  solution 
(error  below  36%),  the  local  charge  distribution  is 
absolutely  missing  from  large-scale  solution 
(Fig.  5a),  and  is  updated  only  by  adding  the 
small-scale  components  (Fig.  5b). 

Here,  we  discuss  if  there  is  any  physical  meaning 
of  the  basically  arithmetic  operations  leading  to 
the  additive  decomposition  of  the  semiconductor 
model  into  large-scale  and  small-scale  compon- 
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ents.  The  method  was  originally  conceived  for  a 
more  accurate  description  of  turbulent  fluid  flow, 
where  the  two  types  of  phenomena  could  be 
observed:  large-scale  fluid  flow  in  the  main  stream, 
and  turbulences  which  might  be  perceived  as 
small-scale  fluctuations  over  the  large-scale  flow. 
Is  there  anything  similar  in  semiconductors?  The 
results  shown  above  suggest  that  the  current 
density  may  be  treated  as  a  “large-scale  value”  as 
it  can  be  calculated  with  a  reasonable  accuracy 
even  on  very  coarse  grids.  We  have  checked 
various  forward  bias  values;  the  error  of  large- 
scale  current  density  ranged  from  0.8%  for 
VAK=  +  1.0V  to  20%  for  VAK=  +2.0 V.  In  the 
1-D  case,  the  value  of  the  total  current  density 
should  be  uniform  and  equal  over  the  entire  length 
of  the  device,  which  again  shows  its  large-scale 
character.  On  the  other  hand,  results  in  Figure  5 
indicate  that  the  space  charge  shows  a  “small-scale 
character”,  as  it  can  be  only  calculated  using  a  fine 
small-scale  grid.  These  observations  actually  agree 
with  the  more  rigorous  analysis  of  the  basic 
semiconductor  equations  using  a  singular  pertur¬ 
bation  approach  [10].  There,  it  is  concluded  that  if 
the  Scharfetter-Gummel  discretizetion  [11]  of  the 
continuity  equations  is  used  (which  is  used  also  in 
our  calculations),  the  current  density  can  be 
calculated  accurately  even  on  a  coarse  discretiza¬ 
tion  grid,  virtually  ignoring  “layers”,  i.e.,  thin 
regions  of  rapid  variations  of  the  potential  and  the 
carrier  densities,  which  physically  correspond  to 
junctions  between  differently  doped  regions  of  the 
device.  However,  to  solve  the  continuity  equations 
and  Poisson’s  equation  accurately  in  the  neighbor¬ 
hood  of  junctions,  a  much  finer  grid  is  required. 
Therefore,  the  distributions  of  the  potential  and 
carrier  densities  (hence,  also  the  net  charge) 
indicate  a  “small-scale  character”  near  the  junc¬ 
tions.  The  presented  additive  decomposition  ap¬ 
proach  revealed  also  this  feature  of  the 
semiconductor  drift-diffusion  equations. 

In  both  cases,  i.e.,  forward  bias  and  reverse  bias 
of  the  p-i-n  diode,  the  additive  decomposition 
results  agreed  quite  well  with  the  results  obtained 
with  traditional  one-grid  approach,  while  the  AD 


method  allows  for  potentially  significant  reduction 
of  matrix  computation  and  computer  memory 
requirements.  Although  in  this  case  the  sections 
with  small-scale  grid  were  calculated  sequentially, 
the  code  has  been  written  as  if  the  sections  were 
processed  in  parallel.  Hence,  the  possible  parallel 
implementation  of  AD  has  been  verified.  Extensive 
parallelization  studies  have  been  conducted  for 
1-D,  2-D  and  3-D  Navier-Stokes  equations  solved 
with  this  general  approach  by  McDonough  and 
co-workers  [12— 14]. 

CONCLUSION 

The  main  advantage  of  the  new  method  is  that  the 
matrix  equations  for  sections  with  small-scale 
points  can  be  solved  independently,  which  allows 
for  parallel  computations  and  smaller  memory 
requirements.  Furthermore,  computations  for  few 
large-scale  points  are  very  fast.  After  the  large- 
scale  solution  is  computed  an  initial  guess  for  the 
small-scale  solution  may  be  updated  using  inter¬ 
polation  of  large-scale  solution,  which  also  leads 
to  more  efficient  computation. 

After  successful  implementation  and  testing  of 
the  AD  method  for  the  drift-diffusion  equations, 
the  authors  intend  to  apply  the  new  approach  to 
the  hydrodynamic  semiconductor  model,  for 
which  the  AD  technique  should  be  also  suitable. 
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High  field  transport  in  phosphor  materials  is  an  essential  element  of  thin  film 
electroluminescent  device  performance.  Due  to  the  high  accelerating  fields  in  these 
structures  (1-3  MV/cm),  a  complete  description  of  transport  under  high  field 
conditions  utilizing  information  on  the  full  band  structure  of  the  material  is  critical 
to  understand  the  light  emission  process  due  to  impact  excitation  of  luminescent 
impurities.  Here  we  investigate  the  role  of  band  structure  for  ZnS,  GaN,  and  SrS  based 
on  empirical  pseudopotential  calculations  to  study  its  effect  on  the  high  field  energy 
distribution  of  conduction  band  electrons. 


Keywords:  High  field  transport,  Monte  Carlo  method,  ZnS,  GaN,  SrS 


INTRODUCTION 

Over  the  past  ten  years,  thin  film  electrolumines¬ 
cent  (TFEL)  devices  have  become  an  important 
technology  for  flat-panel  display  applications  [1]. 
A  TFEL  device  essentially  consists  of  a  thin  layer 
of  a  phosphor  material  sandwiched  between  two 
insulating  layers  which  are  contacted  by  electro¬ 
des.  Wide  band  gap  semiconductors  such  as  ZnS 
and  SrS  heavily  doped  with  a  luminescent  impurity 
are  currently  the  most  important  commercial 
phosphor  materials.  Above  a  critical  potential 
applied  to  the  electrodes  of  a  ZnS  based  TFEL 


device,  electrons  are  injected  into  the  phosphor 
layer  from  deep  trap  states  at  the  phosphor- 
insulator  interface.  For  sufficiently  high  phosphor 
fields,  the  electrons  gain  kinetic  energy  large 
enough  to  initiate  impact  excitation  of  the 
luminescent  impurities  such  as  Mn2+  to  excited 
levels.  Visible  light  is  emitted  when  the  impurities 
return  back  to  the  ground  state.  SrS  doped  with 
Ce3+  has  recently  demonstrated  good  efficiency  as 
a  blue  phosphor  material  in  TFEL  applications, 
which  is  critical  to  realizing  full  color  (RGB) 
displays.  Typically,  the  turn  on  fields  are  higher 
than  those  of  ZnS  devices,  which  may  be 
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associated  with  the  high  field  transport  properties 
of  this  material.  GaN  is  another  wide  band  gap 
material  with  a  similar  band  structure  to  ZnS, 
which  is  currently  of  interest  for  high  temperature 
electronics  and  visible  optical  sources.  However, 
the  efficiency  of  GaN  phosphors  in  TFEL  devices 
is  poor,  which  is  not  understood  at  this  time. 

The  purpose  of  the  present  work  is  to  investigate 
the  relative  high  field  transport  properties  of 
zincblende-type  ZnS  and  GaN  as  well  as  SrS 
based  on  band  structure  features  by  Monte  Carlo 
methods.  In  order  to  better  understand  the 
operation  of  electroluminescent  devices,  it  is 
critical  to  know  to  which  extent  the  TFEL 
performance  is  determined  by  the  transport  in 
the  phosphor  material. 

MONTE  CARLO  MODEL 

The  ensemble  Monte  Carlo  code  presented  in  this 
paper  includes  the  full  band  structure  information 
of  the  phosphor  material  and  all  the  pertinent 
scattering  mechanisms  to  solve  the  semi-classical 
Boltzmann  transport  equation  under  high  field 
conditions.  Our  approach  basically  follows  the 
model  developed  by  the  Illinois  group,  which  is 
described  in  detail  elsewhere  [2]. 

The  standard  empirical  pseudopotential  method 
(EPM)  is  employed  to  obtain  the  band  structure 
for  ZnS,  GaN,  and  SrS.  In  the  case  of  ZnS,  we 
have  extended  the  local  EPM  calculation  reported 
by  Walter  and  Cohen  [3]  and  have  included  also 
nonlocal  corrections  in  the  pseudopotential  [4]. 
The  new  nonlocal  parameter  set  was  determined 
by  comparing  to  optical  data  for  this  material  [5]. 
For  GaN,  the  local  EPM  parameters  published  by 
Fan  et  al.  [6]  have  been  used  in  the  calculation. 
Their  form  factors  were  obtained  by  adjusting  the 
band  structure  results  along  directions  of  high 
symmetry  to  results  based  on  ab  initio  techniques 
and  to  experimental  data  for  band  gaps  and 
effective  masses.  In  the  case  of  SrS,  we  have 
derived  a  nonlocal  EPM  band  structure,  since  a 
reliable  EPM  calculation  is  not  yet  available  in  the 


literature.  As  a  starting  point,  we  have  utilized  the 
atomic  pseudopotentials  for  Sr  and  S,  and 
combined  them  using  the  method  suggested  by 
Harrison  [7]  to  give  a  first  estimate  of  the  band 
structure.  The  resulting  energy  dispersion  gives  a 
direct  gap  at  the  T  point.  Relying  on  various  first 
principles  calculations  [8,9],  SrS  is  believed  to  be 
an  indirect  semiconductor  with  an  indirect  gap 
close  to  4  eV  occuring  at  the  X  point.  We  have 
therefore  added  nonlocal  corrections  to  the  local 
form  factors  to  accomplish  the  transition  to  an 
indirect  band  gap  material.  Other  than  the  energy 
band  gaps  at  the  X  and  T  point  [10],  we  are  aware 
of  no  experimental  band  structure  data  for  this 
material.  Thus,  the  accuracy  of  the  calculated 
energy  dispersion  is  at  present  unknown.  Further¬ 
more,  the  number  of  181  plane  wave  states 
employed  in  the  expansion  of  the  crystal  wave 
functions  might  not  be  large  enough  in  the  case  of 
SrS  to  ensure  convergence  of  the  energy  levels  due 
to  its  pronounced  ionic  character  [11].  Figure  1 
shows  a  comparison  of  the  DOS  for  ZnS,  GaN, 
and  SrS.  As  can  be  seen,  the  DOS  for  SrS  is 
dramatically  larger  than  the  DOS  for  ZnS  and 
GaN. 

The  scattering  rates  for  various  mechanisms  in 
the  semiconductor  material  are  calculated  based 
on  first-order  time  dependent  perturbation  theory. 
For  energies  below  higher  valley  minima,  we  use  a 


FIGURE  1  Comparison  of  the  density  of  states  for  ZnS,  GaN 
and  SrS. 
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nonparabolic  band  model  to  compute  the  scatter¬ 
ing  rates  in  the  lowest  lying  valley  [12].  The 
effective  mass  and  nonparabolicity  parameter  of 
the  valley  entering  into  the  scattering  rates  were 
extracted  directly  from  the  EPM  calculation  of  the 
band  structure.  The  scattering  mechanisms  taken 
into  account  include  scattering  due  to  polar  optical 
phonons,  acoustic  and  optical  phonons  via  the 
deformation  potential  interaction  and  ionized 
impurities.  At  energies  above  higher  valley  thresh¬ 
olds,  electron  scattering  is  dominated  by  the 
optical  deformation  potential  mechanism.  To 
lowest  order,  this  mechanism  is  isotropic  in  nature 
and  depends  only  on  the  density  of  final  states. 
Therefore,  the  DOS  shown  in  Figure  1  is  used 
directly  to  determine  the  scattering  rate  at  high 
energy.  Reported  measurements  of  the  optical 
deformation  potentials  for  inter-  and  intravalley 
scattering  in  ZnS,  GaN,  or  SrS  do  not  exist  to  our 
knowledge.  For  this  reason,  we  treat  the  deforma¬ 
tion  potentials  as  adjustable  parameters  by  fitting 
to  carrier  multiplication  data,  when  available. 

An  inevitable  scattering  mechanism  for  the 
stabilization  of  the  electron  distribution  function 
at  high  electric  fields  is  band-to-band  impact 
ionization  of  electron-hole  pairs  by  energetic 
electrons.  Here,  we  use  an  energy  dependent 
impact  ionization  rate  averaged  over  all  directions 
in  wave  vector  space,  calculated  numerically  from 
the  full  band  structure  of  the  wide  band  gap 
semiconductor  and  well  represented  by  a  power 
law  [5],  [13].  In  our  Monte  Carlo  simulations,  this 
fit  formula  is  incorporated  into  the  collision  term 
of  the  Boltzmann  equation. 

RESULTS  AND  DISCUSSION 

In  the  ensemble  Monte  Carlo  simulations  pre¬ 
sented  here,  we  have  included  the  first  four 
conduction  bands  of  each  semiconductor  studied. 
The  full  band  structure  is  represented  using  916 
points  in  the  irreducible  wedge  of  the  first  Brillouin 
zone.  Typically,  an  ensemble  of  96,000  particles  is 
followed  in  order  to  achieve  sufficient  accuracy  in 


simulating  rare  events  such  as  impact  ionization 
and  impact  excitation. 

Although  systematic  experimental  studies  of 
high  field  transport  in  wide  band  gap  semiconduc¬ 
tors  such  as  ZnS,  GaN,  or  SrS  are  currently  not 
available,  information  regarding  the  high  field 
distribution  may  be  indirectly  deduced  from 
measurement  of  carrier  multiplication  due  to 
band-to-band  impact  ionization.  Such  measure¬ 
ments  were  performed  for  ZnS  by  Thompson  and 
Allen  [14].  We  have  used  a  set  of  two  optical 
deformation  potentials,  one  characterizing  the 
lowest  conduction  band,  and  the  other  characteriz¬ 
ing  the  upper  bands,  to  bring  our  simulated  results 
for  the  impact  ionization  coefficient  versus  field 
into  agreement  with  Thompson  and  Allen’s  data. 
The  details  of  this  choice  of  proper  deformation 
potentials  have  been  discussed  in  Reference  [5].  We 
are  not  aware  of  any  carrier  multiplication 
measurement  for  GaN  and  SrS.  Owing  to  this  lack 
of  information,  values  of  deformation  potentials 
identical  to  those  in  ZnS  have  been  assumed  in  the 
Monte  Carlo  simulations.  By  assuming  the  same 
deformation  potentials  for  each  material,  we  are 
able  to  make  a  comparison  of  the  high  field 
transport  properties  of  these  three  materials  based 
on  the  band  structure  itself. 

Figure  2  shows  the  simulated  steady-state 
particle  distribution  (i.e.,  the  number  of  particles 
in  a  given  energy  range)  in  ZnS,  GaN,  and  SrS  for 
an  electric  field  of  2  MV/cm  at  300  K.  For  Mn2+ 
impurities  in  ZnS  with  an  excitation  threshold 
energy  of  approximately  2.1  eV,  the  comparison  of 
ZnS  to  GaN  indicates  that  there  are  more 
electrons  above  the  impact  excitation  threshold 
due  to  the  lower  DOS  for  GaN,  and  significantly 
fewer  electrons  in  the  case  of  SrS  due  to  its  high 
DOS.  The  secondary  peak  in  the  distribution  for 
ZnS  at  about  4  eV  is  directly  related  to  the 
minimum  in  the  DOS  shown  in  Figure  1.  Since  a 
lower  DOS  translates  to  a  reduced  optical 
deformation  potential  scattering  rate,  the  trends 
in  the  DOS  are  clearly  reflected  in  the  simulated 
high  field  distributions,  provided  that  the  assump¬ 
tion  of  identical  deformation  potentials  is  not  too 
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FIGURE  2  Simulated  particle  distribution  in  ZnS,  GaN  and 
SrS  for  an  electric  field  of  2  MV/cm  at  a  lattice  temperature  of 
300  K  assuming  identical  deformation  potentials. 


unrealistic.  Our  results  seem  to  suggest  that  the 
high  field  properties  of  GaN  would  be  superior  to 
those  of  ZnS  in  TFEL  device  operation,  and  that 
SrS  has  less  favorable  transport  properties.  High 
field  transport  in  SrS  might  improve  at  higher 
fields  because  of  the  tendency  towards  reduced 
DOS  with  increasing  kinetic  energy,  as  can  be  seen 
from  Figure  1.  The  uncertainty  with  regard  to  the 
operation  of  a  SrS  device  is  how  much  of  a  role  is 
played  by  space  charge  and  how  large  is  the 
cathode  field  under  device  operation.  There  is 
experimental  evidence  that  the  cathode  field  may 
be  significantly  larger  than  the  measured,  average 
field  due  to  large  amounts  of  space  charge  [15]. 
These  effects  have  been  ignored  in  the  present 
work. 

Figure  3  shows  the  impact  excitation  yield  and 
impact  ionization  yield  for  electrons  traversing  a 
ZnS  layer  of  0.5  pm  thickness  for  various  electric 
fields.  The  impact  excitation  rate  of  Mn  lumines¬ 
cent  impurities  in  ZnS  used  in  the  Monte  Carlo 
simulations  has  been  calculated  from  the  transition 
rate  derived  by  Bringuier  [16]  based  on  the 
exchange  scattering  process.  For  Mn  impurities 
embedded  in  ZnS,  exchange  scattering  is  known  to 
be  the  dominant  excitation  path.  We  have 
performed  the  high  field  simulations  for  two 


FIGURE  3  Simulated  impact  ionization  and  impact  excita¬ 
tion  yields  for  various  electric  fields  in  a  ZnS  layer  of  0.5  pm 
thickness  doped  with  Mn.  Shown  are  the  results  for  two 
different  lattice  temperatures. 


different  lattice  temperatures:  7=300  K  and 
T=11K.  For  both  temperatures,  the  impact 
ionization  yield  is  larger  than  the  corresponding 
excitation  yield  for  a  density  of  10 20  cm3  Mn 
atoms.  The  comparison  reveals  that  impact 
excitation  as  well  as  impact  ionization  increases 
with  decreasing  lattice  temperature  since  the 
scattering  of  electrons  due  to  phonons  is  reduced. 
At  the  lower  temperature,  the  electron  system 
reaches  a  higher  average  energy  in  the  steady  state 
so  that  more  electrons  are  capable  to  impact  excite 
luminescent  impurities  or  to  generate  electron-hole 
pairs  by  band-to-band  impact  ionization. 

CONCLUSIONS 

We  have  simulated  high  field  transport  in  ZnS, 
GaN,  and  SrS  including  the  full  band  structure  of 
these  materials  in  a  Monte  Carlo  simulation.  The 
simulated  steady-state  distributions  of  electrons  at 
typical  phosphor  fields  reveal  good  transport 
properties  of  ZnS  and  GaN.  High  field  transport 
in  SrS  appears  to  be  less  favorable,  although  space 
charge  effects  might  be  very  important.  We  find 
good  agreement  in  the  expected  trends  of  the 
impact  excitation  yield  with  temperature  in  ZnS 
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based  TFEL  devices.  The  optical  deformation 
potential  is  the  greatest  uncertainty  in  the  present 
Monte  Carlo  model,  which  suggests  further 
investigation  of  the  electron-phonon  coupling  in 
phosphor  materials  is  needed  in  order  to  under¬ 
stand  their  high  field  behavior. 
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FDTD  and  Monte  Carlo  methods  are  combined  to  simulate  the  terahertz  radiation 
from  a  coplanar  photoconducting  structure.  The  simulation  tool  under  consideration 
allows  calculation  of  potentials,  particle  distributions,  current  densities,  and  the  near 
field  electromagnetic  fields  anywhere  in  the  computational  domain.  To  model  the  far 
field  radiation,  it  is  not  efficient  nor,  in  many  cases,  physically  possible  at  present  to  use 
the  FDTD  technique  directly  because  of  the  excessive  computational  burden. 
Techniques  are  discussed  for  modeling  both  the  near  field  and  the  far  field  radiation. 
Computational  results  showing  the  far  field  radiation  are  in  qualitative  agreement  with 
published  experimental  results. 


Keywords:  Terahertz  radiation,  subpicosecond  systems,  photo-conducting  systems,  FDTD,  time 
domain  analysis,  computational  modeling,  electromagnetic  radiation,  far-field  radiation,  near-to- 
far-field  transformation,  Hertzian  dipole 


INTRODUCTION 

Characterization  of  the  electromagnetic  radiation 
arising  from  ultrafast  electronic  structures  whose 
dimensions  approach  the  wavelength  of  operation 
is  a  subject  of  continued  research.  The  design  of 
terahertz  radiators  [1,  2]  as  well  as  applications 
such  as  electrooptic  sampling  [3],  switching  [4],  and 
terahertz  spectroscopy  [5]  all  require  an  accurate 
model  of  the  radiated  electromagnetic  field.  In 


some  simple  cases  the  radiated  field  may  be 
calculated  directly  or  approximated  from  a  known 
excitation.  Alternatively,  numerical  techniques 
such  as  the  finite-difference  time  domain  (FDTD) 
method  allow  computation  of  the  fields  radiated 
from  structures  with  arbitrary  physical  geometry 
or  material  composition.  For  an  accurate  portrait 
of  the  radiated  fields,  the  open  region  simulation 
space  used  in  the  FDTD  should  be  terminated 
with  absorbing  boundary  conditions  (ABCs). 
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ABCs  eliminate  reflections  at  the  edges  of  the 
computational  domain,  allowing  simulation  of 
propagation  in  infinite  free  space. 

The  computational  burden  of  the  FDTD 
method  increases  with  the  dimensions  of  the 
physical  simulation  space,  and  thus  becomes  less 
efficient  for  calculations  far  from  the  source.  The 
FDTD  also  suffers  from  numerical  grid  dispersion 
for  large  computational  domains.  Additionally, 
the  larger  the  simulation  space,  the  larger  the 
computational  burden  involved  in  the  ABC 
calculations.  It  is,  therefore,  desirable  to  use 
alternate  techniques  for  determination  of  the  far 
field  radiation. 


FORMULATION 
Simulation  Tool 

The  present  work  utilizes  a  simulation  tool  which 
has  been  developed  at  Oregon  State  University  [6], 
The  tool  couples  a  Monte  Carlo  particle  simulator 
to  either  a  Poisson  solver  or  an  FDTD  electro¬ 
magnetic  solver  to  determine  device  characteristics 
such  as  potential,  particle  distribution,  and  current 
density  anywhere  in  the  computational  domain.  In 
the  present  case,  the  FDTD  solver  is  used, 
enabling  determination  of  the  electromagnetic 
(EM)  fields  throughout  the  simulation  space.  The 
Monte  Carlo  simulator  and  the  FDTD  solver  are 
coupled  together  in  a  leap  frog  manner,  allowing 
continual  updates  of  both  the  particle  distribution 
and  the  EM  fields,  see  Figure  1. 

The  Monte  Carlo  particle  simulator  allows 
stochastic  solution  to  the  Boltzmann  transport 
equation  for  particle  motion.  Particle  motion  is 
modeled  as  a  series  of  free  flights  subject  to 
magnetic  and  electric  forces.  Motion  is  terminated 
by  instantaneous,  random  scattering  events.  Ran¬ 
dom  scattering  times  are  generated  using  a  random 
number  generator  and  the  calculated  quantum 
mechanical  scattering  cross-section.  Important 
scattering  mechanisms  for  GaAs  are  included  in 
a  three  band  model. 


FIGURE  1  Computer  algorithm  used  in  the  present  work. 

The  FDTD  scheme  used  in  the  present  work 
allows  modeling  of  complex,  three  dimensional 
structures.  Centered  differences  are  used  in  time  and 
space,  utilizing  the  Yee  cell  formulation  [7].  This 
technique  allows  broadband  response  predictions 
and  access  to  field  components  throughout  the 
computational  domain.  The  absorbing  boundary 
conditions  used  in  the  simulations  space  are 
Berenger’s  Perfectly  Matched  Layer  (PML)  [8]. 
Impedance  matching  to  a  highly  lossy  medium  is 
accomplished  by  utilizing  electric  and  magnetic  loss 
terms  to  attenuate  only  the  outgoing  portion  of  the 
wave.  In  the  PML,  absorption  is  theoretically 
independent  of  frequency  and  angle  of  incidence  [8]. 

Photo-Conducting  Experiment 

The  radiation  arising  from  a  photo-conducting 
system  is  modeled  using  the  simulation  tool 
described  above.  In  this  system,  a  sub-picosecond 
laser  pulse  incident  on  a  GaAs  substrate  creates 
electron-hole  pairs,  as  shown  in  Figure  2.  Biased 
electrodes  on  either  side  of  the  pulsed  area  cause 
the  electrons  and  holes  to  migrate  away  from  each 
other,  creating  a  dipole  moment.  A  time-varying 
current  density  is  induced  in  the  GaAs  substrate. 
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FIGURE  2  Typical  structure  for  an  electrooptic  sampling 
experiment. 


The  rise  time  of  the  current  pulse  is  extremely  fast, 
on  the  order  of  a  few  hundred  femtoseconds,  as 
illustrated  in  Figure  3a.  This  current  density  is  the 
source  of  the  EM  radiation  in  the  photo-conduct¬ 
ing  experiment.  Once  the  excitation  is  removed, 
the  biased  electrodes  continue  to  attract  the 
electrons  and  holes,  therefore  recombination 
occurs  over  a  timescale  much  longer  than  the 
simulation  time. 


FIGURE  3  (a)  Current  density  in  the  GaAs  substrate  for  an 
injection  rate  of  lei  5/cm3  and  bias  of  40  volts,  (b)  Numerically 
smoothed  time  derivative  of  the  current  pulse  shown  in  (a). 


RESULTS 


Figures  4a  and  5a  show  the  near  field  radiation 
along  a  surface  above  and  parallel  to  the  GaAs 
substrate  of  the  structure  shown  in  Figure  2.  These 
fields  were  calculated  using  the  simulation  tool 
described  above,  with  a  bias  of  40  volts  on  the 
metal  electrodes,  and  an  injection  rate  of  lei  5/cm3. 
In  this  simulation  it  was  assumed  that  the  laser 
pulse  had  a  Gaussian  wavefront  with  beam  radius 
standard  deviation  of  2  pm,  a  Gaussian-distrib- 
uted  energy  variation  between  1.42  eV  and  1.62  eV 
(larger  than  the  bandgap  of  GaAs),  and  a  pulse 
duration  of  approximately  20  fs. 

If  the  source  is  modeled  as  a  Hertzian  dipole, 
i.e.,  a  dipole  with  negligible  length,  the  radiation 
may  be  found  using  the  well-known  equations  for 
an  infinitesimal  dipole,  repeated,  for  convenience, 
below.  See  e.g.  [9],  for  the  derivation. 
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(/ 
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Here  R  is  the  distance  from  the  dipole  to  the  far 
field  observation  point,  J  is  the  volume  current 
density  induced  by  the  dipole,  and  0  is  measured 
off  the  dipole  axis.  Figures  4b  and  5b  show  the 
fields  from  an  ideal  dipole  for  comparison  to  that 
of  the  near  field  radiation  found  using  the  FDTD 
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FIGURE  4  E$:  (a)  Near  field  radiation  from  the  photo¬ 
conducting  structure  found  using  FDTD.  (b)  Near  field 
radiation  from  an  ideal  Hertzian  dipole. 


FIGURE  5  Er\  (a)  Near  field  radiation  from  the  photo¬ 
conducting  structure  found  using  FDTD.  (b)  Near  field 
radiation  from  an  ideal  Hertzian  dipole. 


simulation  in  Figures  4a  and  5a.  The  effects  of  the 
various  material  parameters  in  the  simulation 
space  may  be  seen  in  Figure  4a.  Specifically,  the 


GaAs/air  interface  causes  some  variation  in  the 
field  distribution  near  the  center  of  the  surface. 

To  model  the  far  field  radiation,  only  terms 
which  decrease  as  l/R  are  retained  from  Eq.  (1).  In 
this  case,  the  electric  field  is  given  by: 


Ee{R ,  0 


sin0  d 
47reoRc2dt 


(2) 


The  far  field  radiation  is  proportional  to  the 
time  derivative  of  the  excitation.  Eq.  (2)  is  often 
used  to  approximate  the  far  field  radiation  arising 
from  photoconducting  antennas  and  arrays  [1]. 
However,  (2)  assumes  a  dipole  source  of  negligible 
length  and  free  space  propagation.  The  use  of  (2) 
in  the  photo-conducting  experiment  also  neglects 
the  effects  of  the  GaAs  substrate  and  metal 
electrodes.  In  addition,  numerical  implementation 
of  the  time  differentiation  emphasizes  the  high 
frequency  noise  components.  Figures  3a  and  3b 
show  a  typical  current  pulse  and  two  numerically 
smoothed  versions  of  its  derivative  with  respect  to 
time.  Smoothing  is  accomplished  using  a  moving 
average  filter.  Figure  3b  compares  to  measured 
results  such  as  [10]. 

It  is  expected  that  a  more  accurate  representa¬ 
tion  of  the  far  field  radiation  would  be  given  with  a 
near-to-far  field  transformation.  This  technique, 
which  has  been  utilized  by  microwave  and  RF 
engineers  for  the  analysis  of  radiation  from,  for 
example,  horn  antennas  [11],  utilizes  an  equivalent 
source  methodology.  The  fields  found  on  a  closed 
virtual  surface  surrounding  the  source  are  used  as 
new  sources  of  radiation,  replacing  the  original 
source,  as  illustrated  in  Figure  6.  In  this  technique, 
the  near  field  radiation  arising  from  a  complex 
structure  may  be  calculated  using  the  FDTD 
method,  and  the  far  field  radiation  may  be  found 
using  the  equivalent  source  in  a  homogeneous 
external  problem  space.  Several  formulations  for 
an  equivalent  source  representation  exist,  inclu¬ 
ding  vector  forms  such  as  the  Schelkunoff  equiva¬ 
lence  theorem  [12],  the  Stratton-Chu  formulation 
[13],  the  Franz  formulation  [14],  and  the  scalar 
Kirchhoff  Surface  Integral  Representation,  de- 
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scribed,  for  example,  in  [15].  All  provide  a 
relationship  between  the  electromagnetic  fields  on 
a  surface  surrounding  a  source  and  the  field  at  a 
point  outside  the  surface.  The  field  produced  by 
the  equivalent  source  at  the  observation  point  will 
be  the  same  as  that  produced  by  the  original 
source.  The  Kirchhoff  formulation  would  be  the 
preferred  technique  in  modeling  the  photo-con¬ 
ducting  structure  because  the  technique  requires 
only  the  values  of  the  electric  fields  on  the 
equivalent  surface.  This  is  an  advantage  when 
used  with  the  FDTD  since  the  electric  and 
magnetic  fields  are  offset  spatially  by  one  half  cell 
[16,  17].  Additionally,  since  this  is  a  scalar 
formulation,  each  field  component  may  be  calcu¬ 
lated  independently,  allowing  placement  of  the 
virtual  surfaces  to  coincide  with  each  field 
component’s  placement  in  the  Yee  cell.  Applica¬ 
tion  of  this  technique  to  the  photoconducting 
structure  is  currently  being  implemented  by  the 
present  authors. 

CONCLUSION 

Techniques  for  describing  and  modeling  the 
terahertz  radiation  from  a  photo-conducting 
structure  have  been  discussed.  The  physical  source 
of  radiation  for  laser  excitation  on  a  GaAs 
substrate  was  described,  and  an  approximation 
of  the  source  in  terms  of  an  ideal  Hertzian  dipole 
was  presented.  A  simulation  tool  which  couples  a 


Monte  Carlo  particle  simulator  with  an  FDTD 
electromagnetic  solver  was  used  to  numerically 
determine  the  near  field  radiation.  Computational 
results  showing  near  field  radiation  patterns  were 
shown.  Techniques  for  approximating  the  far  field 
radiation  were  discussed.  Simulation  results  for  a 
technique  based  on  calculation  of  the  derivative  of 
the  excitation  were  presented. 
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New  “Irreducible  Wedge”  for  Scattering  Rate 
Calculations  in  Full-Zone  Monte  Carlo  Simulations 

JOHN  STANLEY*  and  NEIL  GOLDSMAN 
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To  facilitate  the  implementation  of  accurate  Brillouin  zone  integration  codes  for 
diamond-type  semiconductors  such  as  Si  we  have  constructed  a  new  “irreducible 
wedge”  to  be  used  in  conjunction  with  tetrahedral  fc-space  interpolations.  This  new 
wedge,  defined  by  the  vertices  (000),  (100),  (110)  and  (normalized  to  \X\  =  1), 
spans  portions  of  the  first  three  Brillouin  zones  and  contains  the  equivalent  of  1/48  of  a 
Brillouin  zone  volume.  Simple  symmetry  arguments  may  be  used  to  establish  the 
equivalency  of  this  new  wedge  with  the  standard  irreducible  wedge. 

Keywords:  Monte  Carlo,  Brillouin  zone  integration,  irreducible  wedge 


1.  INTRODUCTION 

Carrier  transport,  investigations  based  on  full-zone 
band  structure  require  accurate  and  efficient 
methods  of  computing  singular  integrals  over  the 
Brillouin  zone.  For  example,  in  Monte  Carlo 
simulations,  net  (integrated)  scattering  rates  and 
scattering  rate  densities  (in  fc-space)  are  needed  to 
determine  scattering  event  occurrences  and  post¬ 
scattering  states  for  carriers  [1].  Such  integrals 
arise  from  Fermi’s  Golden  rule  and  are  of  the 
general  form 

g(e(k'))= J  d3kM{k,k')6{e(k)-[£(k')+Ae{k,k')]}. 


In  particular,  if  Ae  ( k ,  kf)  =  0,  M(k ,  kf)=l/47r 3 
and  ef  =  e  ( k '),  g(el)  gives  density  of  states  [2], 
Integrals  of  this  form  usually  can  only  be 
evaluated  numerically,  particularly  if  one  is  using 
a  full-zone  band  structure  computation  for  the 
dispersion  relation  £  {k).  In  such  cases  one  usually 
replaces  e{k)f  M(k,  kr ),  and  Ae(A:,  kf)  with 
appropriate  piecewise  linear  (for  M(k ,  k!)  and 
A £(k,  k '),  often  piecewise  constant)  interpolants 
defined  on  a  regular  mesh  in  fc-space.  The  under¬ 
lying  fc-space  mesh,  {&„}„,  is  usually  cubic;  inter- 
polatory  elements  are  then  chosen  to  be  either 
cubes  [1],  or  tetrahedra  (each  cube  being  sub¬ 
divided  into  five  (equi-edge-length)  or  six  (equi- 
volume)  tetrahedra  [3].  Band  structure,  e(kn ), 
gradients,  Vic£(kn),  and  scattering  rates,  S(kn ), 
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are  then  pre-computed  and  stored  for  interpola¬ 
tion  during  Monte  Carlo  execution.  Since  the  e  (k) 
relation  possesses  the  (translation  and  point 
group)  symmetry  of  the  crystal,  rather  than  store 
data  at  mesh  points  throughout  the  entire  first 
Brillouin  zone,  it  is  often  expedient  to  use  a  much 
smaller  region  of  k  space  known  as  an  “irreducible 
wedge”.  For  Si,  the  irreducible  wedge  is  1/48  of  the 
Brillouin  zone  volume  and  e  (k)  for  any  k  may  be 
computed  as  e(&/),  where  kj  is  a  symmetry- 
equivalent  wavevector  in  the  irreducible  wedge. 

Density  of  states  and  scattering  rate  integrals  are 
thus  reduced  to  sums  of  element  integrals  over  the 
irreducible  wedge,  each  element  integral  being 
relatively  simple  to  evaluate.  However,  for  the 
standard  irreducible  wedge  in  common  use  (des¬ 
cribed  below),  element  integrals  on  some  boundary 
planes  and  edges  are  cumbersome  to  evaluate.  This 
occurs  because  the  standard  irreducible  wedge 
cannot  be  precisely  filled  using  a  regular1  cubic  or 
tetrahedral  partition;  along  some  boundaries  there 
inevitably  will  be  “overhang”  of  elements  outside 
the  irreducible  wedge.  In  diamond-type  semicon¬ 
ductors,  elements  intersecting  fill]  faces  of  the 
Brillouin  zone  fall  into  this  category  so  that  special 
care  must  be  exercised  to  ensure  accurate  treat¬ 
ment  of  these  contributions  to  Brillouin  zone 
integrals. 


FIGURE  1  Standard  irreducible  wedge  for  Si 


2.  NEW  IRREDUCIBLE  WEDGE 

We  have  constructed  a  new  irreducible  wedge  for 
use  in  diamond-type  materials,  which,  compared 
with  the  standard  irreducible  wedge  in  common 
use  (Fig.  1)  considerably  simplifies  the  handling  of 
boundary  element  contributions  to  Brillouin  zone 
integrals.  This  is  especially  true  when  the  e{k) 
argument  of  the  delta  function  is  treated  to 
quadratic  order.  Our  irreducible  wedge  is  depicted 
in  Figure  2.  Notice  that  the  standard  irreducible 
wedge  lies  entirely  in  the  first  Brillouin  but  has  a 


FIGURE  2  New  irreducible  wedge. 


By  “regular”  we  mean  identical  up  to  translation  and  possibly  a  rotation. 
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somewhat  awkward  geometry,  while  the  irreduci¬ 
ble  wedge  shown  in  Figure  2  (a  tetrahedron  of  the 
same  volume  as  the  standard  wedge)  has  simpler 
geometry  but  contains  contributions  from  the  first 
three  Brillouin  zones. 

In  contrast  to  the  standard  irreducible  wedge, 
this  new  wedge  allows  a  regular  tetrahedral 
discretization  in  which  there  is  essentially  no 
distinction  between  boundary  element  and  interior 
element  contributions  to  integrals.  That  is,  the  new 
wedge  may  be  precisely  filled  with  a  regular  array 
of  tetrahedral  elements,  while  for  the  standard 
wedge,  one  inevitably  finds  elements  intersecting 
the  LUWK  boundary  plane  to  have  portions 
extending  outside  the  wedge.  The  presence  of  such 
“overhang”,  while  posing  no  significant  difficulties 
for  element  interpolations,  can  have  a  troublesome 
impact  on  the  evaluation  of  integrals.  The  reason 
for  this  is  as  follows.  Assuming  a  tetrahedral 
interpolation  scheme,  first  note  that  evaluating 
interior  element  contributions  to  Brillouin  zone 
integrals  is  relatively  straight-forward  (at  least  for 
the  case  where  energy  is  interpolated  only  to  linear 
order);  tetrahedral  element  integrals  reduce  to 
simple  surface  integrals  over  planar  triangles 
contained  within  the  element.  The  difficulty  for 
boundary  elements  not  conforming  to  the  wedge 
boundary  arises  from  the  need  to  include  only  that 
portion  of  the  (constant  energy  or  post-scattering 
state)  surface  which  lies  within  the  tetrahedron  and 
also  within  the  wedge.  Clearly,  no  such  difficulties 
occur  for  interior  elements  or  for  boundary 
elements  conforming  to  the  wedge  boundary. 

To  illustrate  the  construction  of  this  new 
irreducible  wedge,  refer  to  Figures  3-6.  In  Fig¬ 
ure  3  we  show  an  octahedron,  which,  if  the  vertices 
are  sliced  off  by  [100]  planes,  gives  the  first 
Brillouin  zone  (Fig.  4).  The  surface  of  the  second 
Brillouin  zone  is  shown  in  Figure  5.  Note  that  both 
the  first  and  second  Brillouin  zones  are  defined 
entirely  by  [100]  and  [^5]  planes. 

An  examination  of  the  second  Brillouin  zone 
reveals,  by  translation  symmetry,  that  the  portions 
of  the  octahedron  vertices  extending  beyond  the 
(100)  points  may  equivalently  be  subdivided  into 


FIGURE  3  An  octahedron. 


FIGURE  4  1st  Brillouin  zone  of  fee  lattice. 


four  identical  tetrahedrons  and  moved  pairwise 
from  the  (100)  axes  to  fill  the  “notches”  present  at 
the  surface  along  (110)  directions.  In  carrying  out 
this  manipulation  one  converts  the  second  Bril¬ 
louin  zone  into  a  cube  without  changing  the 
volume  enclosed.  The  legitimacy  of  this  transfor¬ 
mation  is  readily  established  by  appealing  to  the 
underlying  space  group  symmetry.  If  we  now 
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FIGURE  5  Surface  of  2nd  Brillouin  zone  of  fee  lattice. 


FIGURE  6  Wedge  of  modified  2nd  Brillouin  zone. 


extend  the  standard  irreducible  wedge  to  pierce  the 
faces  of  this  cube,  we  obtain  the  right-tetrahedron 
shown  in  Figure  6.  This  irreducible  wedge  is 
exactly  twice  the  volume  of  the  standard  irreduci¬ 
ble  wedge,  containing  1/48  of  a  zone  in  the  first 


FIGURE  7  Density  of  states  in  Si  computed  using  new 
irreducible  wedge. 


Brillouin  zone  and  (the  equivalent  of)  1/48’th  of  a 
zone  in  the  second  and  third  Brillouin  zones. 

To  reduce  this  tetrahedron  to  an  equivalent 
irreducible  wedge,  we  again  appeal  to  symmetry. 
The  tetrahedron  shown  in  Figure  6  may  be  halved 
by  cleaving  along  a  plane;  the  two  halves  are 
identical  by  virtue  of  (diamond-type)  reflection 
symmetry  in  [111]  planes.  We  thus  arrive  at  the 
new  irreducible  wedge  shown  in  Figure  2  . 

3.  DENSITY  OF  STATES  CALCULATION 

To  illustrate  the  quality  of  integration  provided  by 
the  new  wedge,  Figure  7  shows  the  results  of  a 
density  of  states  calculation  for  the  lowest  ten  (4 
valence  plus  6  conduction)  bands  in  Si  using  a  local, 
113  plane-wave  pseudopotential  calculation  [4] 
(without  spin-orbit  coupling  included).  The  mesh 
used  consists  of  4000  tetrahedrons  (946  distinct 
vertices)  filling  the  new  irreducible  wedge.  With  a 
denoting  lattice  constant,  this  corresponds  to  0.05  x 
(27 r/a)  increments  for  mesh  points  along  (100) 
directions.  By  using  a  regular  tetrahedral  discretiza¬ 
tion  of  the  new  wedge,  one  obtains  remarkably 
smooth  results  with  relatively  little  coding  effort. 
For  the  present  calculation  the  e  (k)  argument  of  the 
delta  function  is  treated  to  linear  order  leading  to 
“wiggles”  near  van  Hove  singularities  [5]. 

4.  CONCLUSIONS 

We  have  defined  a  new  1/48  Brillouin  zone 
irreducible  wedge  applicable  to  diamond-type 
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semiconductors.  This  new  irreducible  wedge  facil¬ 
itates  accurate  numerical  evaluation  of  Brillouin 
zone  integrals  such  as  density  of  states  and 
scattering  rates  by  simplifying  considerably  the 
handling  of  boundary  element  contributions  to 
such  integrals. 
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A  self-consistent  numerical  simulation  model  for  a  pin  single-cell  solar  cell  is  formulated. 
The  solar  cell  device  consists  of  a  />-AlGaAs  region,  an  intrinsic  /-AlGaAs/GaAs 
region  with  several  quantum  wells,  and  a  w-AlGaAs  region.  Our  simulator  solves  a 
field-dependent  Schrodinger  equation  self-consistently  with  Poisson  and  drift-diffusion 
equations.  The  field-dependent  Schrodinger  equation  is  solved  using  the  transfer  matrix 
method.  The  eigenfunctions  and  eigenenergies  obtained  are  used  to  calculate  the  escape 
rate  of  carriers  from  the  quantum  wells,  the  capture  rates  of  carriers  by  the  wells,  the 
absorption  spectra  in  the  wells,  and  the  non-radiative  recombination  rates  of  carriers  in 
the  quantum  wells.  These  rates  are  then  used  in  a  self-consistent  finite-difference 
numerical  Poisson-drift-diffusion  solver.  We  believe  this  is  the  first  such  comprehensive 
model  ever  reported. 


Keywords:  Quantum  well,  solar  cell,  Schrodinger,  escape,  capture,  recombination 


1.  INTRODUCTION 

The  conversion  efficiency  of  a  single  cell  pin  solar 
cell  can  be  enhanced  by  incorporating  quantum 
wells  in  the  intrinsic  region  of  the  device.  [1]  The 
incorporation  of  the  quantum  wells  has  two 
counteracting  effects:  the  short-circuit  current  is 
increased  because  of  the  additional  absorption  of 
the  low-energy  photons  in  the  lower  bandgap 
quantum  well  and  the  open-circuit  voltage  is 
decreased  because  of  the  increase  in  the  recombi¬ 
nation  of  the  photoexcited  carriers  trapped  in  the 


quantum  well.  Experimental  results  have  shown, 
nevertheless,  that  the  additional  photocurrent 
resulting  from  the  extension  of  the  absorption 
spectrum  to  lower  energies  can  outweigh  the 
accompanying  drop  in  the  open-circuit  voltage 

[2-3]. 

Along  with  these  experimental  studies,  a  number 
of  theoretical  investigations  have  been  performed. 
Corkish  and  Green  [4]  studied  the  effects  of 
recombination  of  carriers  in  the  quantum  well 
and  concluded  that  although  the  increased  recom¬ 
bination  reduces  the  open-circuit  voltage,  limited 
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enhancement  in  the  conversion  efficiency  can  be 
obtained  with  incorporation  of  the  quantum  well, 
albeit  not  as  much  as  previously  reported  by 
Barnham  and  Duggan  [1].  Araujo  et  ah  [5]  used 
detailed  balance  theory  and  predicted  that  the 
conversion  efficiency  of  the  quantum  well  cell 
would  not  exceed  that  of  the  base-line  device.  The 
results  of  photoresponse  calculations  by  Renaud 
et  al.  [6]  revealed  that  introducing  the  quantum 
wells  can  lead  to  improved  photocurrent  without 
much  degradation  of  the  open-circuit  voltage. 
Most  recently,  Anderson  [7]  presented  an  ideal 
model  for  the  quantum  well  solar  cell  device, 
incorporating  the  recombination  and  generation  in 
the  quantum  wells.  Anderson  concluded  that  the 
improvement  in  efficiency  is  achieved  only  when 
the  depth  of  the  quantum  well  is  less  than  about 
200  meV. 

The  need  for  a  comprehensive  model  is  rather 
obvious,  now  that  there  seems  to  be  an  unsolved 
debate  as  to  the  ultimate  advantage  of  incorporat¬ 
ing  quantum  wells  in  the  intrinsic  region  of  a  pin 
solar  cell.  In  this  paper  we  present  formulation  of 
one  such  model  in  which  we  self-consistently 
include  the  effects  of: 

1)  Capture  of  electrons  by  the  wells, 

2)  Escape  of  electrons  from  the  wells, 

3)  Absorption  of  light  in  the  wells,  and 

4)  Recombination  of  carriers  in  the  wells. 

The  standard  drift-diffusion  equations  are  mod¬ 
ified  to  account  for  generation  and  recombination 
in  the  quantum  wells  and  the  transfer  of  electrons 
and  holes  between  the  bulk  and  quantum  well 
systems. 


2.  SELF-CONSISTENT  MODEL 


The  steady-state  transport  of  carriers  in  the  pin 
structure  is  described  by  current  continuity  equa¬ 
tion  written  for  the  bulk  regions  as: 


^  =  Gl-Ub  +  ^.Ht  +  lJJi  =  0,  (1) 

at  r”  t”  q  dx 


where  nb  and  nw  are  electron  densities  in  the  bulk 
and  quantum  wells,  respectively.  Jn  is  electron 
current  density.  The  two  additional  terms  to  the 
standard  bulk  current  continuity  equation  incor¬ 
porate  the  effect  of  carrier  transfer  into  and  out  of 
the  quantum  wells.  The  terms  nw/r  ”  and  nb/r "  are 
the  electrons  escape  and  capture  rates,  respec¬ 
tively.  These  rates  are  also  used  in  the  continuity 
equations  for  the  quantum  wells,  as  given  by: 


dnw 

dt 


nb 


nw 

Te 


+  Gw  —  Uw  —  0. 


(2) 


In  the  above  equations  r"  and  r"  are  the 
electrons  escape  and  capture  times,  respectively. 
The  escape  times  of  carriers  are  calculated  using 
the  model  reported  by  Moss  et  al.  [8]  and  capture 
times  are  extrapolated  from  theoretical  and  experi¬ 
mental  data  reported  by  Blom  et  al.  [9].  Similar 
equations  are  written  for  holes.  The  boundary 
conditions  for  continuity  equations  are  derived 
from  surface  recombination  velocity  model. 

In  the  above  continuity  equations,  the  recombi¬ 
nation  in  the  bulk  is  modeled  with  radiative  and 
non-radiative  mechanisms.  The  term  Gb  is  bulk 
generation  rate  and  is  given  by: 


where  Xc  is  set  to  correspond  to  the  bandgap  of  the 
material.  The  recombination  in  the  quantum  well 
Uw  is  a  modified  Shockley-Read-Hall  recombina¬ 
tion  rate  given  by: 


TJ  <?n apVthNtjpn  -p<m] 

cr„{n  +  nt]  +  ap[p  +  pt] 

where  the  trap  density,  Nt,  is  derived  from  the 
density  of  the  interface  states.  The  generation  term 
Gw  is  calculated  from  Eq.  (3)  with  the  bulk 
absorption  coefficient  replaced  with  that  of  the 
quantum  well.  The  absorption  coefficients  of  the 
quantum  wells  are  calculated  by  a  model  reported 
by  Stevens  et  al.  [10]. 
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Calculation  of  the  absorption  spectra  in  the 
quantum  wells  as  well  as  the  escape  and  capture 
times  of  carriers  require  the  eigenfunctions  and 
eigenenergies  of  the  carriers  in  the  quantum  wells, 
which  are  obtained  from  a  field-dependent 
Schrodinger  equation  given  by: 


3.  RESULTS 

The  energy  band  diagram  of  the  pin  solar  cell 
device  with  four  quantum  wells  in  the  intrinsic 
region  is  shown  in  Figure  L  We  simulated  five  pin 
devices  all  with  a  acceptor  doping  level  of  1018/cc 


~h2  d  1  d 
2  dxm*(x)  dx 


+  V{x) 


i;(x)  =  Eiip{x) 


(5) 


where  ^(x)  is  the  envelope  function,  Et  are  the 
eigenenergies  and  V(x)  is  the  potential  profile. 
Non-constant  effective  mass  m*{x)  is  assumed. 
The  Schrodinger  equation  is  solved  using  the 
transfer  matrix  method  [11].  The  above  equations, 
together  with  Poisson  and  drift-diffusion  equa¬ 
tions  are  solved  using  a  finite  difference  scheme. 


FIGURE  1  Energy  band  diagram  of  the  solar  cell  with  four 
quantum  wells. 


Dark  Current- Voltage  Characteristics  of  MOW  pin  AIGaAs  Diodes 


FIGURE  2  Dark  characteristics  of  MQW  pin  solar  cells. 


422 


S.  RAMEY  AND  R.  KHOIE 


and  donor  doping  level  of  4  x  1017/cc  in  the  0.2 
pm  ^-region  and  0.3  pm  w-region,  respectively.  The 
intrinsic  region  of  all  five  devices  is  0.5  pm.  The 
first  device  is  a  base-line  GaAs,  the  second  device 
is  a  base-line  AlGaAs  device  and  the  remaining 
three  devices  are  AlGaAs  devices  with  1,  10  and  20 
quantum  wells,  respectively.  The  mole  fraction  of 
A1  is  0.3  in  all  AlGaAs  devices  and  all  quantum 
wells  are  100  A  wide. 

We  simulated  the  dark  characteristics  of  all  five 
cells  and  the  results  are  shown  in  Figure  2.  Because 
the  model  incorporates  both  radiative  and  non- 
radiative  recombination  for  the  bulk  material,  the 
slope  of  the  dark  current  is  qv/nkT ,  with  the 
ideality  factor  n  ranging  from  one  for  exclusively 
radiative  recombination  to  two  for  exclusively 
non-radiative.  In  the  pin  devices  without  quantum 
wells,  n  approaches  one  at  high  forward  bias  where 
radiative  recombination  becomes  more  important. 
At  low  biases  the  non-radiative  recombination 
dominates  and  n  is  1.9  for  the  AlGaAs  device  at 
0.1  volts  applied. 

The  recombination  in  the  quantum  wells  is 
modeled  as  non-radiative  interface  recombination 
using  a  modified  Shockley-Read-Hall  expression. 
With  the  introduction  of  quantum  wells  into  the 
pin  device,  this  non-radiative  recombination  dom¬ 
inates  at  all  biases  and  the  ideality  factor  equals 
about  1.7  for  all  the  quantum  well  devices.  The 
observed  crossover  of  the  i—v  curves  of  the 
quantum  well  devices  and  the  GaAs  device  has 
also  been  reported  by  Ragay  et  al.  [3]. 
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A  quantum  mechanical  treatment  of  electron  inversion  layers  is  incorporated  in  the 
hydrodynamic  (HD)  transport  model  used  in  UT-MiniMOS.  A  physically  based,  yet 
computationally  efficient,  three-subband  model  is  implemented  in  the  HD  simulation 
tool.  The  three-subband  model,  which  is  based  upon  solutions  to  Schrodinger’s 
equation,  has  the  important  advantage  of  more  accurately  predicting  the  distribution  of 
electrons  in  the  inversion  layers  than  does  more  conventional  classical  models.  A  more 
simplified  quantum  mechanical  model  with  carrier  heating  effects  included  has  also  been 
developed.  Terminal  currents  are  calculated  using  these  quantum  mechanical  models 
and  the  comparison  with  results  from  classical  calculations  indicates  the  importance  of 
quantum  mechanical  effects  in  the  deep  submicron  device  simulations. 


Keywords :  MOS,  inversion  layer,  quantization,  hydrodynamic  simulation,  carrier  confinement 


INTRODUCTION 

The  quantum  mechanical  treatment  of  carrier 
confinement  in  silicon  MOSFET  inversion  layers 
is  increasingly  essential  for  accurate  simulation  of 
devices  scaled  into  the  deep  submicron  region.  It  is 
well  known  that  a  high  transverse  electric  field  at 
the  Si/Si02  interface  leads  to  a  large  energy  band 
bending,  which  in  turn  can  cause  quantization  of 
the  inversion  layer  carriers  in  the  direction  per¬ 


pendicular  to  the  interface  [1].  Self-consistent 
solutions  of  the  Schrodinger’s  and  Poisson  equa¬ 
tions  have  been  adopted  by  a  number  of  research¬ 
ers  to  investigate  the  two-dimensional  nature  of 
electrons  in  inversion  layers  [2, 3].  However,  from 
the  point  of  view  of  the  device  design  engineer,  less 
computationally  expensive  approaches  that  may 
be  routinely  used  with  existing  device  simulators 
are  necessary.  Van  Dort’s  simple  quantum  model 
[4]  and  three  subband  quantization  model  (3SB)  [5] 
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have  been  implemented  in  drift-diffusion  simula¬ 
tion  tools  and  have  facilitated  the  prediction  of 
device  electrical  behavior  (such  as  threshold 
voltage  and  capacitance). 

The  three  subband  model  introduces  an  addi¬ 
tional  level  of  physical  sophistication  by  solving 
for  the  energy-level  and  wave-function  for  the  first 
three  subbands,  using  approximate  solutions  to 
Schrodinger’s  equation  based  upon  physically 
reasonable  estimates  of  the  form  of  the  potential 
well.  This  approach  has  the  advantage  of  correctly 
predicting  the  electron  distribution  in  the  inversion 
layers  due  to  quantum  mechanical  effects  and,  in 
turn,  leads  to  better  predictive  capabilities  for  C- 
V  simulation  and  physical  oxide  thickness  estima¬ 
tion.  In  the  hydrodynamic  (HD)  3-subband 
simulation,  carrier  temperature  instead  of  lattice 
temperature  is  used  to  distribute  electrons  among 
each  subband  and  between  the  classical  (three 
dimensional  electron  gas)  and  the  quantum  (two 
dimensional  electron  gas)  domains.  As  a  result, 
under  high  drain  bias  condition,  a  large  portion  of 
electrons  near  the  drain  edge  will  be  in  the  classical 
domain  due  to  the  carrier  heating  effect.  In  this 
paper,  we  present,  for  the  first  time,  HD  simula¬ 
tion  results  with  a  3SB  model  included  using  UT- 
MiniMOS  [6]. 

CARRIER  CONFINEMENT  AND 
QUANTIZATION  MODEL 
IMPLEMENTATION 

When  carriers  are  quantized  in  the  potential  well, 
the  density  of  states  must  be  represented  by  a  two- 
dimensional  (2-D)  density  of  states  instead  of  the 
three-dimensional  (3-D)  density  of  states  that  is 
assumed  in  classical  calculations.  Knowing  the 
eigenvalue  and  eigenfunction  for  each  subband,  we 
can  represent  the  distribution  of  charge  in  a  2-D 
system  by  the  following  expression: 

M  ^ ^carrier 

WQM(Z)  = - -3 —2LJlvimzi 

tv n  i 


where  nvi  denotes  the  degeneracy  of  the  zth 
subband,  Ey  the  subband  energy  level  of  the  yth 
valley  in  the  ith  subband,  Ec]  the  energy  level  of  the 
classical  domain,  and  ^{z)  the  wavefunction 
describing  the  distribution  of  carriers  in  the 
subband.  In  the  three  subband  model,  Etj  and  (y 
are  calculated  via  self-consistent  solutions  of 
Schrodinger’s  equation  and  Poisson’s  equation. 
However,  a  priori  assumptions  about  the  shape  of 
the  potential  well  are  used  to  solve  (1)  based  upon 
analytical  expressions  for  Ey  and  (y.  A  triangular 
potential  well  is  used  in  the  three  subband  model. 
For  this  form  of  the  potential,  the  solution  to 
Schrodinger’s  equation  is  the  Airy  function  where 
the  eigenenergies  for  the  system  can  be  calculated 
exactly  as  a  function  of  the  surface  field  [2,  5].  To 
account  for  the  perturbation  of  the  potential  due 
to  the  inversion  layer  charges,  a  variational 
wavefunction  originally  proposed  by  Fang  and 
Howard  [7]  was  used  for  the  lowest  subband.  This 
bypasses  the  computational  expense  of  solving 
Schrodinger’s  equation  explicitly.  In  order  to 
maintain  additional  computational  efficiency,  only 
the  first  three  subbands  are  used  in  the  calculation 
of  (1).  Above  the  energy  level  Ech  the  carriers  are 
all  treated  as  classical  carriers  with  a  3-D  density 
of  states.  This  approach  ensures  agreement  with 
classical  calculations  in  the  limit  of  low  channel 
doping  in  addition  to  reducing  the  total  computa¬ 
tion  time. 

Since  the  electron  temperature  (or  average 
energy)  is  used  as  an  additional  variable  in  the 
HD  simulation,  the  temperature  effect  must  be 
included  in  the  quasi-Fermi  level  calculation  in 
order  to  obtain  a  correct  carrier  distribution 
among  subbands.  Also,  due  to  the  electron 
heating,  the  classical  carrier  component  may  play 
an  important  role  in  the  carrier  density  calculation 
for  the  region  near  the  drain  edge.  The  quasi- 


HD  SIMULATION  OF  NMOSFETs 


425 


Fermi  level  and  the  classical  carrier  component 
need  to  be  calculated  correctly  in  the  3  SB  model  to 
ensure  that  the  HD  simulation  converges  and  that 
the  simulation  results  are  physically  correct.  In  our 
approach,  the  fractional  electron  population  in  the 
quantum  domain  as  a  function  of  the  electron 
energy  is  modeled  using  our  quantum-window 
enhanced  uniform-field  Monte  Carlo  (MC)  simu¬ 
lator  [8],  as  shown  in  Figure  1.  The  fractional 
population-energy  relationship  in  our  HD  simula¬ 
tion  is  obtained  from  Monte  Carlo  simulation 
under  the  low  transverse  effective  field  condition 
(£trans=  1-9  x  lO5  V/cm).  Very  little  impact  on  the 
terminal  current,  even  under  the  high  drain  bias 
condition,  is  observed  by  choosing  a  different 
fractional  population-energy  relationship  from 
Monte  Carlo  simulations  with  different  transverse 
effective  fields.  The  reasons  for  the  insensibility  of 
terminal  current  on  the  fractional  population- 
energy  relationship  are  being  under  investigations. 
The  fractional  population,  so  modeled,  can  then 
be  used  to  calculate  the  sheet  quantum  carrier 
concentration  (Nqm)  and  classical  carrier  concen¬ 
tration  (Nd)  for  a  given  total  sheet  carrier 
concentration  (7Vqm  +  Nc\)  after  the  drift-diffusion 
solution  is  obtained.  By  knowing  Nqm  and  Nc\,  we 
can  calculate  the  quasi-Fermi  level  and  classical 


Electron  Energy  (eV) 

FIGURE  1  Fractional  population  of  electrons  in  the  quan¬ 
tum  domain  (first  three  subbands)  as  a  function  of  the  electron 
average  energy  from  our  uniform-field  Monte  Carlo  simulation. 
A  simple  analytic  fit  used  in  the  hydrodynamic  simulation  is 
also  plotted. 


effective  density  of  states.  The  carrier  distribution 
along  the  depth  direction  is  subsequently  calcu¬ 
lated  in  the  3 SB  model  to  obtain  the  effective 
intrinsic  carrier  density,  following  the  similar 
approach  of  Ref.  [5]. 

In  addition  to  the  more  physically  accurate 
three-subband  model,  more  simplified  models  are 
of  interest  to  the  modeling  community.  While 
these  models  do  not  attempt  to  predict  the 
quantum  mechanically  consistent  charge  distribu¬ 
tion,  they  are  able  to  predict  the  effect  of 
quantization  on  the  terminal  characteristics  when 
this  is  the  only  information  required.  This  is 
accomplished  by  altering  the  bandgap  of  the 
silicon  near  the  interface  via  the  intrinsic  carrier 
concentration  [4].  The  change  in  the  bandgap  is 
modeled  as  follows: 

(2) 

where  Es  denotes  the  transverse  electric  field  and  / 3 
the  fitting  parameter  used  to  calibrate  the  model  to 
experimental  data  and/or  more  physically  based 
quantization  models.  This  modified  bandgap  is 
then  used  to  make  corrections  to  the  charge 
density  near  the  interface.  The  carrier  temperature 
is  used  when  the  HD  models  are  turned  on, 
otherwise  the  lattice  temperature  is  used.  This 
modification  permits  the  model  to  approach  the 
classical  results  when  the  energy  distribution  of  the 
carriers  is  highly  energetic,  and  hence  more 
classical-like. 


SIMULATION  RESULTS  AND  DISCUSSION 

In  order  to  explore  the  importance  of  including 
carrier  confinement  models,  as  well  as  carrier 
heating  models,  in  device  simulation  tools,  we  have 
examined  the  behavior  of  the  drift-diffusion  and 
hydrodynamic  transport  models  both  with  quan¬ 
tum  mechanical  carrier  confinement  models  and 
without.  We  used  as  our  prototypical  device 
structure  the  simple  0.18  micron  nMOS  device 
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presented  in  Figure  2.  The  doping  profiles, 
junctions  depths  and  oxide  thickness  were  selected 
to  be  representative  of  devices  of  this  technology 
generation. 

An  examination  of  the  electron  concentration 
near  drain  edge  at  low  drain  bias  (Vj=  0.1  V) 
illustrates  that  the  electron  distribution  obtained 
from  the  HD  3SB  calculation  is  almost  the  same  as 
that  from  a  DD-3SB  calculation  (Fig.  3(a)).  This 
result  can  be  understood  by  the  fact  that  there  is 
almost  no  carrier  heating  for  such  a  low  drain  bias. 
At  high  drain  bias  (Vd=  1.8  V),  on  the  other  hand, 
the  distribution  near  the  drain  edge  obtained  from 
the  HD-  3  SB  calculation  differs  significantly  from 
that  obtained  with  DD  3SB  (Fig.  3(b)).  In  the 
drift-diffusion  approach,  regardless  of  lateral  field 
conditions,  the  carriers  are  assumed  to  be  at  the 
lattice  temperature,  so  the  carriers  are  assumed  to 
occupy  the  lowest  subbands  (i.e.,  quantum  me¬ 
chanical  regime)  more  readily,  and  hence  have  a 
smaller  classical  contribution  (i.e.,  the  unbound 
states).  However,  in  the  hydrodynamic  approach, 
under  high  lateral  electric  field  conditions,  sig¬ 
nificant  carrier  heating  can  occur  near  the  drain 
edge.  Such  energetic  carriers  tend  to  occupy  bound 
states  in  the  higher  subbands  and  the  unbound 
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FIGURE  2  Device  structure  for  prototypical  0.18  micron 
nMOS  transistor.  This  structure  is  used  throughout  the  results 
section  of  the  paper. 
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FIGURE  3  (a)  Electron  concentration  along  the  depth  near 

the  drain  edge  at  the  low  drain  bias  (K^=0.1  V)  for  both  DD 
and  HD  simulations  (classical  and  3-subband)  (b)  Electron 
concentration  along  the  depth  near  the  drain  edge  at  the  high 
drain  bias  (K</=1.8V)  for  both  DD  and  HD  simulations 
(classical  and  3-subband). 

states.  Hence,  for  high  drain  bias,  the  electron 
distribution  near  the  drain  edge  calculated  from 
HD  3-subband  is  more  classical-like. 

The  Id-  Vd  characteristics  from  the  HD  simula¬ 
tions  are  presented  in  Figure  4.  Either  a  tempera¬ 
ture  corrected  van  Dort’s  model  or  the  3-subband 
quantum  mechanical  model  is  used  in  the  HD 
quantum  mechanical  calculations.  The  results 
from  classical  HD  simulation  are  also  presented 
for  comparison.  It  should  be  noted  that  the  drain 
currents  from  the  temperature  corrected  van 
Dort’s  model  agree  very  well  with  those  from  the 
three  subband  HD  simulations.  Also,  the  obser¬ 
vable  difference  in  saturation  currents  between 
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FIGURE  4  Drain  currents  versus  drain  voltage  from  the 
hydrodynamic  simulation,  using  classical,  temperature  cor¬ 
rected  van  Dort’s  model  and  3-subband  model,  respectively. 


classical  and  quantum  mechanical  calculation 
suggests  that  the  carrier  confinement  effects  should 
be  taken  into  account  for  the  deep  submicron 
device  simulation.  The  Id-  Vg  characteristics  from 
HD  simulation  with  classical,  temperature  cor¬ 
rected  van  Dort  model,  and  3-subband  model, 
respectively  (Fig.  5),  indicate  that  the  two  quan¬ 
tum  models  predict  almost  the  same  threshold 
voltage,  while  the  threshold  voltage  from  classical 
HD  calculations  is  smaller  than  that  from  quan¬ 
tum  mechanical  HD  calculations.  The  difference  in 
the  threshold  voltage  between  the  classical  and 
quantum  mechanical  calculations  is  believed  to 


FIGURE  5  Drain  current  versus  gate  voltage  from  the 
hydrodynamic  simulation,  using  classical,  temperature  corrected 
van  Dort’s  model  and  the  3-subband  model,  respectively. 


contribute  to  the  difference  in  the  saturation  drain 
current  between  HD  classical  and  HD  quantum 
mechanical  calculations,  as  shown  in  Figure  4. 

CONCLUSION 

Carrier  confinement  and  quantization  models  have 
been  successfully  implemented  in  the  hydrody¬ 
namic  (HD)  simulation  tool  UT-MiniMOS.  Under 
high  drain  bias  conditions,  the  electron  concentra¬ 
tion  near  the  drain  edge  calculated  from  HD 
simulation  differs  significantly  from  that  of  DD 
simulation  due  to  the  carrier  heating  effect,  both 
with  and  without  quantum  effects  included.  Under 
low  dran  bias  conditions,  quantum  confinement 
effects  are  significant  while  carrier  heating  effects 
are  negligible.  The  Id-  Vd  and  Id-  Vg  character¬ 
istics  suggest  the  need  for  both  the  HD  treatment 
and  the  QM  picture  either  through  the  use  of  the 
3SB  model  or  a  temperature  corrected  van  Dort 
model. 
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Non-local  electron  transport  in  nMOSFET  inversion  layers  has  been  studied  by  Monte 
Carlo  (MC)  simulations.  Inversion  layer  quantization  has  been  explicitly  included  in  the 
calculation  of  density  of  states  and  scattering  rate  for  low-energy  electrons  while  bulk 
band  structure  is  used  to  describe  the  transport  of  more  energetic  electrons.  For 
uniform,  high-lateral  field  conditions,  the  effects  of  quantization  are  less  pronounced 
due  to  the  depopulation  of  electrons  in  the  lower-lying  subbands.  On  the  other  hand, 
Monte  Carlo  results  for  carrier  transport  in  spatially  varying  lateral  fields  (such  as  those 
in  the  inversion  layer  of  MOSFETs)  clearly  indicate  that  depopulation  of  the  low-lying 
subbands  is  less  evident  in  the  non-local  transport  regime.  Quasi-2D  simulations  have 
shown  that,  at  high  transverse  effective  field,  the  inclusion  of  a  quantization  domain 
does  have  an  impact  on  the  calculated  spatial  velocity  transient  . 


Keywords:  «MOS,  inversion  layer,  velocity  overshoot,  quantization,  Monte  Carlo  simulation 


INTRODUCTION 

With  the  continued  scaling  of  the  feature  size  of 
MOS  devices,  carrier  transport  in  the  MOS 
inversion  layers  has  entered  the  regime  where 
non-local  effects  are  no  longer  negligible.  In  a 
previous  study,  velocity  overshoot  was  found  to 
account  for  approximately  20%  of  the  disagree¬ 
ment  in  drain  current  between  measurement  and 
drift-diffusion  simulation  in  a  0.12  pm  SOI 
MOSFET  [1].  Despite  a  few  pioneering  studies 
on  probing  the  electron  spatial  transient  velocity  in 


compound  semiconductor  devices  [2],  a  mature 
experimental  technique  that  allows  the  direct 
measurement  of  carrier  velocity  in  silicon  MOS¬ 
FETs  has  not  been  available.  In  order  to 
accurately  express  these  non-local  effects  in  phy¬ 
sically  based  models,  the  Monte  Carlo  (MC) 
technique  has  become  increasingly  indispensable 
for  device  scientists  and  engineers.  In  conventional 
MC  tools,  effects  due  to  channel  quantization 
resulting  from  the  steep  bulk  band  bending  are 
routinely  ignored.  With  phenomenological  sur¬ 
face-roughness  scattering  models,  some  conven- 
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tional  MC  tools  [3,4]  using  bulk  silicon  band 
structure  have  demonstrated  the  ability  to  repro¬ 
duce  the  experimental  inversion-layer  velocity-field 
characteristics  in  the  local  transport  regime  within  a 
limited  range  of  temperatures.  However,  the  valid¬ 
ity  and  accuracy  of  this  type  of  approach  remains 
questionable  in  the  non-local  transport  regime. 

In  favor  of  the  semi-classical  approach,  one 
might  argue  that  non-local  transport  is  most 
generally  observed  under  conditions  of  high 
electric  field  in  the  transport  direction.  A  condition 
in  which  the  inversion  layer  electrons  populate 
higher  energy  states  and  the  effects  of  quantization 
are  less  pronounced.  However,  the  issue  regarding 
how  fast  the  2DEG  can  respond  to  an  abruptly 
changing  field,  hence  become  more  classical-like, 
has  not  been  investigated.  The  purpose  of  this 
paper  is  to  qualitatively  address  these  issues  with 
simulations  performed  on  quasi-2D  test  structures. 
As  will  be  shown,  inversion-layer  quantization 
does  have  an  impact  on  the  carrier  average  velocity 
in  the  non-local  regime,  despite  the  presence  of 
high  lateral  electric  field. 

THE  MONTE  CARLO  SIMULATOR 

The  MC  program  used  in  this  work  is  an 
integrated  tool  consisting  of  two  simulation 
domains  that  partition  the  entire  inversion-layer 
electron  population  into  quantum  (2D)  and 
classical  (3D)  components.  The  2D  domain  is  used 
to  simulate  transport  of  low-energy  electrons  by 
directly  solving  the  multi-subband  2D  Boltzmann 
transport  equation  [5].  The  3D  domain,  employing 
three  fitted  conduction  bands,  is  used  for  treating 
more  energetic  electrons  that  have  motions  that 
are  more  classical-like  [6],  The  design  of  the 
integrated  tool  allows  the  3D  domain  to  execute 
without  invoking  the  2D  domain.  In  its  stand¬ 
alone  mode,  the  3D  domain  takes  into  account 
phonon  scattering  and  impact  ionization  and  has 
been  calibrated  to  reproduce  uniform  transport 
characteristics  in  bulk  silicon.  It  also  employs  a 
phenomenological  model,  which  was  previously 


validated,  to  take  into  account  surface  roughness 
scattering  [4],  In  the  2D  domain,  the  effective-mass 
approximation  (EMA)  including  bulk  nonparabo- 
licity  is  used  and  effects  of  scattering  due  to  bulk 
phonon  and  surface  roughness  have  been  ac¬ 
counted  for.  Coulomb  scattering  is  greatly  sup¬ 
pressed  due  to  carrier  screening  in  the  strong- 
inversion  regime  and  has  been  ignored  in  the 
present  treatment.  The  envelope  functions  obtained 
within  the  EMA  are  used  to  calculate  the  electron- 
phonon  form  factors  needed  for  computing  the 
phonon  scattering  rate.  The  parameters  used  in 
phonon  and  surface  roughness  scattering  models 
have  been  adjusted  to  reproduce  the  experimental 
mobility  values.  As  shown  in  Figure  1,  the 
inversion  layer  ohmic  mobilities  obtained  with  the 
2D  domain  agree  well  with  those  obtained  in 
previous  experimental  studies  (at  413  K  and  77  K) 
and  the  extensively  validated  UT-mobility  model 
(at  300  K)  [7,8].  It  should  be  emphasized  that  all 
the  MC  mobility  values  are  obtained  with  a  single 
set  of  scattering  model  parameters. 

Physical  boundary  conditions  between  the  2D 
and  3D  domains  are  employed.  The  implementa¬ 
tion  is  similar  to  that  proposed  by  Fischetti  et  al. 
[9]  except  for  the  use  of  a  2D/3D  boundary  (in  the 
energy  space)  more  consistent  with  the  EMA  and 
our  previous  pseudo-potential  calculation  [10]. 
Within  the  EMA,  the  envelope  function  of  a  given 


Effective  Field  (in  V/cm) 

FIGURE  1  low-field  mobility  obtained  with  the  single 
particle  MC  compared  with  experimental  and  UT-mobility 
model  values. 
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subband  is  independent  of  the  parallel  momentum. 
Such  an  insensitivity,  confirmed  in  [10],  indicates 
that  an  electron  with  a  large  parallel  momentum  is 
as  well  quantized  as  that  with  a  small  parallel 
momentum  in  the  same  subband,  despite  the 
difference  between  their  total  energies.  Instead  of 
defining  the  2D/3D  boundary  in  terms  of  total 
energy,  as  suggested  in  [9],  we  use  a  criterion 
based  on  the  electron  total  energy  in  the  quantiza¬ 
tion  (z-)  direction.  The  concept  of  “buffer  sub- 
bands”  suggested  in  [9]  has  been  adopted  to  allow 
2D  electrons  to  enter  the  3D  domain  via  scatter¬ 
ing.  The  six  lowest-lying  subbands  and  64  buffer 
subbands  have  been  used.  Applying  the  above 
criteria,  as  illustrated  in  Figure  2,  a  3D  electron  is 
converted  to  the  quantum  domain  when  the 
following  two  conditions  are  met:  (1)  The  electron 
is  located  in  the  lowest  bulk  valley.  (2)  The  total 
energy  of  the  3D  electron  in  the  z-direction  (Ez)  is 
less  than  E^th\  where  E^th\  is  defined  to  be  the 
edge  of  the  lowest  buffer  subband  and  Ez  is 
evaluated  within  the  parabolic  approximation.  On 
the  other  hand,  a  2D  electron  is  converted  to  the 
classical  domain  when  it  enters  one  of  the  buffer 
subbands  via  intersubband  scattering.  Conserva¬ 
tion  of  the  electron  in-plane  momentum  is  imposed 
in  the  conversion  process.  Since  the  energy 
spectrum  in  the  3D  domain  is  a  continuum,  energy 
conservation  is  enforced  for  the  2D  to  3D 
transition.  However,  energy  cannot  be  conserved 
during  the  3D  to  2D  transition  because  only 
discrete  subband  levels  are  available  in  the  2D 
domain.  In  this  case,  the  subband  that  minimizes 
the  energy  mismatch  is  chosen.  If  the  energy 
conservation  law  were  strictly  enforced  during  the 


FIGURE  2  Illustration  of  the  partition  of  simulation  domain 
in  the  inversion-layer  quantum  well. 


simulation,  the  2D  domain  would  be  depleted  of 
electrons  at  steady  state  under  uniform,  high- 
lateral-field  condition,  which  is  clearly  unphysical. 
To  circumvent  this  problem  without  violating  the 
energy  conservation,  one  needs  to  directly  calcu¬ 
late  the  scattering  rate  between  the  3D  and  the  2D 
states.  This  alternative  approach  is  expected  to  be 
comparable  to  our  current  implementation  since 
the  2D  density  of  states  obtained  with  the  quantum 
treatment  is  expected  to  be  comparable  to  that 
obtained  with  the  classical  treatment  at  sufficiently 
high  energy  (the  correspondence  principle). 

With  the  integrated  MC  tool,  the  velocity- 
versus-field  relations  obtained  at  various  trans¬ 
verse  Eeff  are  observed  to  agree  well  with  the  UT- 
mobility  model  (see  Fig.  3).  As  expected,  the 
calculated  saturation  velocity  is  the  same  as  that  in 
bulk  silicon  since  at  high  field,  the  3D  population 
dominates  and  surface  roughness  scattering  is  less 
effective.  This  is  further  evidenced  in  Figure  4, 
where  the  populations  of  electrons  in  the  quantum 
and  classical  domains  are  separately  plotted  with 
respect  to  the  lateral  field.  At  low  lateral  field,  the 
3D  electrons  have  a  larger  population  for  the  less 
inverted  channel  due  to  the  weaker  quantum 
confinement  at  lower  E& ff.  For  the  same  reason, 
the  3D  component  takes  over  the  2D  component 
at  a  lower  lateral  field  in  the  less  inverted  channel. 


Lateral  field  (V/cm) 


FIGURE  3  Velocity  versus  field  obtained  from  the  MC  tool 
including  both  2D  and  3D  domains  (symbols)  compared  to  that 
obtained  with  the  UT  mobility  model  (lines). 
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FIGURE  4  Populations  of  electrons  in  the  quantum  and 
classical  domains. 


SIMULATION  PROCEDURE  AND  RESULT 

Quasi-2D  structures  with  uniform  MOS  inversion 
layers  subjected  to  artificial  lateral  field  profiles 
that  vary  abruptly  along  the  channel  (x-)  direction 
are  studied.  A  ID  Schrodinger-Poisson  iteration 
following  classical  drift-diffusion  simulation  is 
used  to  obtain  the  self-consistent  subband  disper¬ 
sion  and  scattering  rate  for  given  substrate  doping 
levels  (TVgub)  and  gate  biases  (UG).  Either  a  step  or 
a  ramp  profile  is  chosen  for  the  lateral  field  across 
a  0.9  pm  channel.  The  low-field  region  is 
sufficiently  long  for  transport  to  reach  the  steady 
state  corresponding  to  the  local  field.  A  periodic 
boundary  condition  is  applied  to  both  ends  of  the 
channel  with  carrier  injection  conserving  both 
momentum  and  subband  index.  To  directly  assess 
velocity  overshoot  due  solely  to  the  applied  field, 
Poisson  feedback  resulting  from  the  carrier  redis¬ 
tribution  along  the  channel  has  been  ignored. 

To  appreciate  the  importance  of  the  quantum 
domain  in  the  non-local  transport  regime,  the 
fractional  electron  population  in  the  2D  domain 
under  step-like  and  ramp-shaped  lateral  field 
profiles  is  examined.  The  ramp  field  profile 
corresponds  to  a  2V  voltage  drop  across  the 
0.2  pm  channel  and  roughly  conforms  with 
the  power-supply  scaling  trend  associated  with 
the  scaling  of  device  dimensions.  The  step  profile 


represents  a  worse  case  scenario  where  a  power 
supply  of  4V  is  applied  across  the  channel.  In 
Figure  5,  it  is  observed  that  the  2D  electron 
population  in  the  high  lateral-field  region  remains 
significant  (>40%)  in  the  overshoot  regions 
(around  x  =  0.5pm  for  the  ramp  profile  and 
x  =  0.35  pm  for  the  step  profile),  suggesting  that 
quantization  might  be  of  importance  in  the  non¬ 
local  transport  regime  in  spite  of  the  presence  of 
the  high  lateral  fields.  It  should  be  noted  that,  had 
there  been  no  non-local  effects,  inspection  of 
Figure  4  would  have  lead  to  the  estimated 
fractional  2D  populations  at  these  locations  of 
less  than  20%  even  without  taking  into  account 


x  (gm) 


FIGURE  5  (a)  Fractional  population  of  electrons  in  the  2D 
simulation  domain  (symbols).  Applied  lateral  field  profiles  are 
also  shown  (lines),  (b)  Velocity  profiles  along  the  lateral 
direction  obtained  with  the  lateral  field  profiles  in  (a). 
Overshoot  occurs  near  x  =  0.5|im  and  x  =  0.35  pm  for  the  step 
profile  and  ramped  profile,  respectively. 
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the  fact  that  the  Eefr  used  in  Figure  5  is  lower  than 
those  used  in  Figure  4. 

With  the  same  ramped  lateral  field  profile  shown 
in  Figure  5(a),  Figure  6  shows  that  the  transient 
velocity  obtained  at  different  Jeff’s  and  Nsu^s.  In 
general,  the  transient  velocity  seems  to  have  a 
stronger  dependence  on  Ee^  than  on  Nsu In  the 
low-field  region,  this  is  expected  since  the  quantum 
domain  explains  universal  mobility  quite  well.  The 
universality  of  velocity  (and  mobility)  dependence 
on  Eef[  seems  to  be  retained  along  the  rising  edge 
of  the  transient  velocity  until  the  latter  becomes 
sufficiently  high,  where  Nsub  starts  to  have  some 
impact.  At  higher  £efr  the  lower  velocity  observed 
on  the  rising  edge  can  be  qualitatively  explained  by 
the  lower  mobility  of  the  ground  subband  due  to 
stronger  surface  roughness  scattering.  Near  the 
velocity  peak,  on  the  other  hand,  a  simple 
explanation  for  the  different  velocities  is  not 
available  since  all  subbands  in  the  quantum 
domain  and  the  classical  domain  are  equally 
significant. 

To  see  the  quantization  effects  on  the  MC- 
simulated  velocity  profile,  Figures  7(a)  and  7(b) 
compare  the  velocity  profiles  obtained  with  con¬ 
ventional  MC  with  those  obtained  with  the 
integrated  MC  tool  at  Eefr=  5.6  x  105  V/cm  and 
9.8  xlO5  V/cm,  respectively.  In  both  cases,  lower 


x  (nm) 

FIGURE  6  Velocity  along  the  channel  with  ramped  field 
profile  at  two  different  £eff.  More  pronounced  overshoot  is  seen 
at  the  lower  £eff.  For  each  £efr,  two  different  substrate  doping 
levels  are  used. 


EeH  =5.6x10  5  V/cm 


Eef)  =9.8x10  5  V/cm 


FIGURE  7  Velocity  profiles  obtained  from  conventional  MC 
simulations  compared  to  those  obtained  from  the  integrated 
tool.  Different  EeffS  and  same  substrate  doping  level  (1  x  1016 
cm-3)  are  used  in  (a)  and  (b). 


peak  velocities  are  predicted  by  the  integrated  MC. 
Although  the  difference  between  two  MC  results 
slightly  increases  as  quantization  becomes  stron¬ 
ger,  we  are  reluctant  make  the  intuitively  appeal¬ 
ing  statement  that  the  difference  is  indeed 
enhanced  as  channel  becomes  more  strongly 
inverted.  If  such  a  trend  does  exist,  the  impact  of 
including  channel  quantization  on  the  predicted 
velocity  overshoot  in  conventional  deep-submi¬ 
cron  MOSFETs  might  be  limited  as  the  quantum 
confinement  is  weaker  near  the  drain.  However,  as 
aggressive  device  design  with  asymmetric  channel 
doping  profile  has  been  carried  out  [11]  to  take 
advantage  of  velocity  overshoot  near  the  source, 
where  the  silicon  surface  is  strongly  inverted, 
quantization  effects  on  the  spatial  velocity  transi¬ 
ent  is  likely  to  be  more  significant  in  these  devices. 


CONCLUSION 

Quasi-2D  Monte  Carlo  simulations  that  take  into 
account  the  effects  due  to  size  quantization  have 
been  used  to  study  carrier  velocity  overshoot  in 
nMOS  inversion  layers.  Non-local  effects  are  seen 
not  only  in  carrier  velocity  but  also  in  the  carrier 
population.  With  a  given  ramp-field  profile,  the 
spatial  transient  velocity  is  observed  to  depend  on 
the  transverse  effective  field  more  strongly  than  on 
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the  substrate  doping.  In  this  work,  more  pro¬ 
nounced  velocity  overshoot  is  observed  at  lower 

^eff* 

The  impact  of  quantization  on  simulated  velocity 
profiles  has  been  observed  by  comparing  results 
obtained  with  the  conventional  MC  to  those 
obtained  with  the  integrated  MC  tool.  While 
quantization  seems  to  have  a  slightly  larger  impact 
at  higher  £eff,  more  thorough  study  is  needed  to 
support  such  a  viewpoint,  given  the  uncertainties  in 
models  used  in  both  MC  tools.  Since  the  compar¬ 
ison  is  based  on  two  different  simulation  models, 
each  of  which  contains  certain  number  of  para¬ 
meters  calibrated  such  that  uniform-condition 
transport  characteristics  can  be  reproduced,  we 
cannot  exclude  the  possibility  that  there  exists  a 
different  region  in  the  parameter  space  where  one 
can  draw  a  conclusion  different  from  ours.  In  fact, 
even  among  the  currently  existing  well-calibrated 
classical  MC  codes,  disagreement  in  spatial  tran¬ 
sient  velocity  is  very  likely  to  exist.  The  same 
statement  can  also  apply  to  the  quantum  MC  codes. 
More  studies  are  needed  to  clarify  the  issues  related 
to  the  impact  of  quantization  on  velocity  overshoot. 
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By  using  an  energy  transport  model,  we  simulate  cutoff  frequency  fT  versus  collector 
current  density  Ic  characteristics  of  npn~ n  AlGaAs/GaAs  heterojunction  bipolar 
transistors  (HBTs)  with  various  ^'-collector  thickness  and  ^“-doping  densities.  It  is 
found  that  the  calculated  fT  characteristics  show  double  peak  behavior  when  the  n~- 
layer  is  thick  enough  and  the  ^“-doping  is  high  enough  to  allow  existence  of  neutral  n~ - 
region.  The  mechanism  of  the  double  peak  behavior  is  discussed  by  studying  energy 
band  diagrams,  electron-energy  profiles  and  electron-velocity  profiles.  Particularly,  we 
discuss  the  origin  of  the  second  peak  (at  higher  Ic)  which  is  not  usually  reported 
experimentally. 


Keywords:  GaAs,  heterojunction  bipolar  transistor,  energy  transport  model,  cutoff  frequency, 
double  peaks,  velocity  overshoot 


1.  INTRODUCTION 

Recently,  AlGaAs/GaAs  heterojunction  bipolar 
transistors  (HBTs)  have  received  great  interest  for 
application  to  high-speed  and  high-frequency 
devices.  Since  non  equilibrium  carrier  transport 
becomes  important  in  the  HBTs,  carrier  energy 
should  be  considered  in  the  modeling  of  them.  For 
this  purpose,  the  Monte  Carlo  simulation  [1,2] 
and  so-called  an  energy  transport  model  [3-7] 
have  been  applied  to  analyze  the  characteristics  of 
AlGaAs/GaAs  HBTs. 


Cutoff  frequency  fT  is  one  of  the  figure-of-merits 
of  HBTs’  high-frequency  performance.  Usually, 
the  cutoff  frequency  of  AlGaAs/GaAs  HBTs 
increases  with  the  collector  current  density  Ic 
and  begins  to  decrease  at  a  certain  /c,  showing  a 
single  peak  in  the  experimental  fr—Ic  character¬ 
istics.  However,  according  to  the  simulation  using 
a  drift-diffusion  model,  the  fr-Ic  characteristics 
show  a  steep  second  peak  in  some  cases  [5, 8].  This 
is  attributed  to  the  fact  that  in  the  drift-diffusion 
approximation,  electron  mobility  is  given  as  a 
function  of  local  electric  field  and  the  electron 
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velocity  versus  electric  field  curve  of  GaAs  shows  a 
peak  behavior.  By  the  other  simulation  methods, 
the  second  peak  has  not  been  reported  yet. 

In  this  work,  we  have  systematically  and  care¬ 
fully  analyzed  the  fT  characteristics  of  AlGaAs/ 
GaAs  HBTs  by  using  an  energy  transport  model 
[9]  in  which  electron  mobility  is  determined  by  an 
electron  energy  (not  by  local  electric  field)  and 
velocity  overshoot  can  be  treated.  As  a  result,  we 
have  found  that  the  second  peak  can  arise  (at 
rather  high  current  levels)  also  when  using  this 
model.  Therefore,  we  discuss  here  the  physical 
reason  why  the  double  peak  behavior  in  the  fT 
characteristics  arises. 


2.  PHYSICAL  MODEL 


density  and  carrier  energy.  We  have  simulated  the 
device  characteristics  as  parameters  of  ^“-collector 
thickness  LC\  and  its  doping  density  NCi- 
Next  we  describe  the  derivation  of  electron 
transport  equations.  Assuming  parabolic  energy 
bands,  three  conservation  equations,  *  that  is,  a 
particle  conservation  equation,  a  momentum 
conservation  equation  and  an  energy  conservation 
equation  are  obtained  by  taking  moments  of  the 
Boltzmann  equation  [10],  To  make  these  equations 
tractable  and  applicable  to  the  HBTs,  the  follow¬ 
ing  assumptions  are  made  [5].  Firstly,  we  adopt 
relaxation  time  approximation  for  collision  terms. 
Secondly,  we  adopt  an  equivalent  one-valley 
model.  If  we  assume  that  electron  drift  energy  is 
negligible  small  as  compared  to  its  thermal  energy, 
the  average  electron  energy  wn  can  be  written  as 


2.1.  Device  Structure  and  Basic  Equations 

Figure  1  shows  an  npn~ n  AlGaAs/GaAs  HBT 
structure  analyzed  here.  A1  composition  changes 
from  0.3  to  0  in  the  emitter  and  base  regions,  and 
so  this  is  a  graded  band-gap  base  HBT.  Doping 
densities  are  different  in  respective  regions,  and 
hence  in  general,  transport  parameters  should  be 
given  as  functions  of  A1  composition,  doping 


DISTANCE  ( jjnn) 


FIGURE  1  AlGaAs/GaAs  HBT  structure  simulated  in  this 
study. 


Wn  =  ^  kTn  +  Fy  Alu  (1) 

where  Tn  is  electron  temperature,  Fv  is  upper- 
valley  fraction  and  ALU  is  energy  difference 
between  upper  and  lower  valleys.  To  treat  HBTs, 
some  factors  are  also  considered.  Carrier  recom¬ 
bination  is  treated  by  SRH  statistics,  and  an 
effective  field  acting  on  electrons  which  arises  due 
to  the  position  dependence  of  band  structure  is 
included.  Then  the  electron  transport  equations 
are  simplified  as  follows. 


dh 

'  dz 


=  qU 


(2) 


■Jz{  + 


■4 


JnEn-wn  U-n 


wn  -  w0 


(4) 


where  U  is  the  recombination  rate,  vn  is  the 
average  electron  velocity,  pn  is  the  electron 
mobility,  En  is  the  effective  field  acting  on  electrons, 
rw  is  the  energy  relaxation  time  and  w0  is  the 
equilibrium  value  of  wn.  In  the  electron  transport 
equations,  parameters  that  should  be  given  are 
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the  electron  mobility  the  energy  relaxation 
time  rw  and  the  upper-valley  fraction  Fv. 

For  holes,  we  use  drift-diffusion  type  equations. 
In  addition  to  the  transport  equations,  we  include 
Poisson’s  equation,  which  completes  the  basic 
equations  for  device  simulation. 

2.2.  Transport  Parameters 

Here  we  describe  methods  of  giving  transport 
parameters  such  as  f, tn, ,  rw  and  Fv .  To  do  this, 
usually,  homogeneous  bulk  is  assumed  (d/dz  =  0). 
Then,  the  following  two  equations  are  obtained 
from  the  previous  electron  transport  equations. 

Vfi  —  (5) 

w„  =  w0  -  qrwvnE  (6) 

where  E  is  electric  field.  By  using  the  Monte  Carlo 
method,  v„,  wn  and  Fv  are  obtained  as  a  function 
of  electric  field  E.  Thus,  by  using  Eqs.  (5)  and  (6), 
the  energy-dependent  /x„,  rw  and  Fv  can  be 
obtained. 

In  this  work,  we  must  give  the  transport 
parameters  as  functions  of  A1  composition  x, 
electron  energy  wn  and  doping  density  N,  To  do 
this,  we  first  evaluate  rw  and  Fv  for  x  =  0, 
0.05,  0.1,  0.15,  0.2,  0.25  and  0.3  by  a  Monte  Carlo 
method.  Doping  densities  are  also  varied.  Once 
these  parameters  are  available  as  fit  curves  or 
tables  for  a  given  doping  density,  parameters  for 
any  x  between  0  and  0.3  can  be  obtained  (as  a 
function  of  wn)  by  a  linear  extraporation  method. 
That  is,  if  x  lies  between  Xj  and  x2,  the  parameter / 
is  given  by  the  following  equation. 

/K)  =  /l  +f^-(/2-/i)  (7) 

X2  X\ 

where  f\  and  f2  are  corresponding  values  of  /  for 
x  =  X!  and  x2,  respectively. 

Next  we  show  some  examples  of  transport 
parameters  estimated  by  a  Monte  Carlo  method. 
Figure  2  shows  electron  mobility  energy  relaxa¬ 
tion  time  rw  and  upper  valley  fraction  Fv  as  a 


FIGURE  2  Electron  mobility  \Ln,  energy  relaxation  time  ru 
and  upper- valley  fraction  Fv  versus  electron  energy  u>„  curves  as 
a  parameter  of  x  in  AlxGai_x  As,  calculated  by  a  Monte  Carlo 
method.  The  energy  difference  between  upper  and  lower  valleys 
Alu  and  upper-valley  effective  mass  mu  are  set  to  0.284- 
0.605  x  (eV)  and  0.23  m0,  respectively. 

parameter  of  A1  composition  x.  The  doping  density 
is  2x  1018  cm-3.  Here,  we  consider  L-valley  as  upper 
valley  and  use  L-valley’ s  parameters  for  ALU  and 
upper- valley  effective  mass.  We  also  treat  a  case 
somewhat  considering  X-valley’s  contribution,  but 
x  and  energy  dependences  of  estimated  parameters 
are  essentially  similar  to  those  shown  in  Figure  2  [9]. 
As  described  before,  once  the  figures  like  Figure  2 
are  obtained  for  a  given  doping  density,  the 
transport  parameters  for  any  x  and  for  any  electron 
energy  are  obtained  by  the  linear  extraporation 
method.  Of  course,  when  the  doping  density 
becomes  different,  another  figure  is  required. 
Furthermore,  the  parameters  must  be  evaluated  in 
every  mesh  point.  However,  this  approach  is  simple 
in  itself  and  can  be  easily  implemented. 


3.  SIMULATED  fT-lc  CHARACTERISTICS 

We  calculate  /r-/c  characteristics  of  the  AlGaAs/ 
GaAs  HBT  as  parameters  of  the  n~  collector 
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thickness  LC\  and  its  doping  density  Nc i-  Here  fT 
is  calculated  from  the  following  equation. 

h  =  (1  /2*){dIc/dQ„)Va  (8) 

where  Qn  is  electron  charges  in  the  device  and  VCe 
is  the  collector-emitter  voltage.  VCe  is  set  to  1.5  V 
in  this  study. 

Figure  3  shows  calculated  fT  -  Ic  characteristics 
of  the  HBT  as  a  parameterof  «“-layer  doping 
density  NCu  where  LC1  is  set  to  0.5  pm.  Figure  4 
shows  calculated  fr~Ic  characteristics  of  the  HBT 
as  a  parameter  of  /T-layer  thickness  LC\ ,  where  NC\ 
is  set  to  5  x  1016  cm-3.  As  7C  increases, /r  increases 
because  the  emitter  charging  time  and  the  collector 
charging  time  are  reduced.  From  these  figures,  we 
see  that  for  lower  Nc i  (1016  cm-3  in  Fig.  3)  and  for 
thinner  LC\  (0.1  pm  and  0.2  pm  in  Fig.  4),  fT 
characteristics  show  a  single  peak.  These  cases 
correspond  to  the  situation  that  /z“-collector  layer 
is  almost  or  fully  depleted  already  when  the  base- 
emitter  voltage  (FBe)  is  0  V.  In  the  other  cases, two 
peaks  are  clearly  seen  in  the  fT  characteristics.  In 
these  cases,  neutral  ^“-region  exists  at  Fbe~ 0  V. 

Up  to  the  first  peak,  fT  is  higher  for  higher  NC\, 
as  seen  from  Figure  3.  Also,  the  value  of  Ic  where 
fr  begins  to  decrease  is  higher  for  higher  Nc\ • 
These  are  because  around  the  peak  region,  the 
transit  time  through  ^“-collector  depletion  layer 
(which  is  thinner  for  higher  Nc 0  is  dominant  [5] 
and  fT  begins  to  decrease  due  to  a  high  injection 
effect  which  leads  to  expanding  the  depletion  layer 


FIGURE  3  Calculated  fr~Ic  curves  of  AlGaAs/GaAs  HBTs 
as  a  parameter  of  ^“-collector  doping  density  Nc i-  The  n~ - 
collector  thickness  LC\  is  0.5  pm. 


FIGURE  4  Calculated  /r-/c  curves  of  AIGaAs/GaAs  HBTs 
as  a  parameter  of  «“-collector  thickness  Lq\  .  The  n~ -collector 
doping  density  NC\  is  5  x  10 16  cm-3. 

and  increasing  the  collector  transit  time.  As  seen  in 
Figure  4,  the  fT  characteristics  are  essentially 
similar  between  the  two  cases  with  LC1  =  0.5  pm 
and  0.7  pm.  This  is  because  the  thickness  of  n~- 
collector  depletion  layer  is  determined  by  NC\.  As 
shown  in  Figure  3,  when  NC\  is  higher,  the  value  of 
Ic  where  fT  takes  the  second  peak  is  higher.  We 
will  discuss  below  why  the  fT  characteristics  show 
double  peak  behavior. 

Figure  5  shows  energy  band  diagrams  as  a 
parameter  of  Ic  for  the  HBT  where  NC\  =  5  x  1016 
cm-3  and  LC\  =  0.5  pm.  Figures  6  and  7  show  the 
corresponding  electron-energy  profiles  and  elec¬ 
tron-velocity  profiles,  respectively.  In  these  figures 
Ic=  104  A/cm2,  3xl04  A/cm2,  8xl04  A/cm2,  1.2  x 
105  A/cm2  and  1.5xl05  A/cm2  correspond  to  the 
regions  before  the  first  peak,  around  the  first  peak, 


FIGURE  5  Energy  band  diagrams  of  an  AIGaAs/GaAs  HBT 
with  LC\  =  0.5  pm  and  Ncx  =  5  x  1016  cm-3  as  a  parameter  of 
ICV ce=1.5V. 
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FIGURE  6  Electron-energy  profiles  of  an  AlGaAs/GaAs 
HBT,  corresponding  to  Figure  5.  Lc i=0.5  pm  and  NC\~ 
5xl0,6cm“3. 


FIGURE  7  Electron-energy  profiles  of  an  AlGaAs/GaAs 
HBT,  corresponding  to  Figure  5.  LC\  =  0.5  pm  and  NC\  —  5  x 
1016  cm-3. 


at  the  local  minimum,  at  the  second  peak,  and 
after  the  second  peak  respectively,  in  the  fr~Ic 
characteristics.  From  these,  we  interpret  the 
double  peak  behavior  in  the  following  way. 

As  is  understood  from  Figure  5,  the  first  peak 
arises  because  the  depletion  layer  in  the  n~ - 
collector  layer  expands  due  to  an  high  injection 
effect,  and  hence  the  collector  transit  time  in¬ 
creases.  The  fall  of  Ic  should  last  until  the  n~- 
collector  layer  becomes  entirely  depleted.  Around 
Ic=  8x  104  A/cm2,  the  «“-layer  is  entirely  depleted 
as  seen  from  Figure  5  and  at  this  current,  fT 
characteristics  show  the  local  minimum.  As  can  be 
seen  from  the  band  diagrams,  the  electric  field  at 
the  ^“-collector  layer  near  the  base  becomes 
weaker  when  Ic  increases  further.  Hence,  as  shown 
in  Figure  6,  the  electron  energy  in  this  region 


becomes  lower.  Consequently,  the  electron  mobi¬ 
lity  becomes  high,  leading  to  the  higher  electron 
velocity  than  the  saturation  velocity,  for  example, 
at  7c=1.2xl05  A/cm2  as  shown  in  Figure  7. 
Therefore,  the  collector  transit  time  becomes 
shorter  temporarily  and  hence  fT  begins  to  increase 
again.  In  the  end,  however,  fT  falls  because  of  the 
base-push-out  effect  (Kirk  effect)  which  leads  to  a 
lower  electron  velocity  around  the  base-collector 
interface  and  higher  collector  capacitance  due  to 
the  injected  electrons  whose  densities  become 
higher  than  Nc n  Thus  the  second  peak  arises.  As 
is  evident  from  the  above  discussion,  the  value  of 
Ic  where  fT  shows  a  second  peak  becomes  higher 
for  higher  Nc i. 

As  described  above,  we  have  shown  theoretically 

that  double  peak  behavior  can  be  seen  in  the  fr~Ic 

characteristics  of  AlGaAs/GaAs  HBTs.  Physical 

mechanism  of  this  behavior  has  been  explained. 

♦ 


4.  CONCLUSION 

By  using  an  energy  transport  model,  we  have 
simulated  fT~Ic  characteristics  of  AlGaAs/GaAs 
HBTs  with  various  ^“-collector  thickness  and  n~ - 
doping  densities.  It  is  found  that  the  calculated  fT 
characteristics  show  the  double  peak  feature  when 
the  «“-layer  is  thick  enough  and  the  ^“-doping  is 
high  enough  to  allow  existance  of  the  neutral  n~ - 
region.  It  is  interpreted  that  the  first  peak  arises 
because  the  depletion  region  in  the  «~-layer  begins 
to  expand  due  to  a  high  injection  effect  and  the 
collector  transit  time  increases.  The  fall  of  fT  lasts 
until  the  «“-layer  becomes  entirely  depleted.  When 
the  base  voltage  is  raised  further,  the  electric  field 
in  the  «_-layer  near  the  base  becomes  lower, 
leading  to  the  lower  electron  energy  there.  Then, 
the  electron  velocity  in  the  «~-layer  becomes 
higher,  resulting  in  shorter  collector  transit  time. 
Therefore,  fT  begins  to  increase  again.  Finally,  fT 
decreases  due  to  the  base-push-out  effect  (Kirk 
effect),  resulting  in  the  second  peak.  We  can  say 
that  the  double  peak  behavior  can  be  seen  in  the  fT 
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characteristics  of  real  AlGaAs/GaAs  HBTs  if  the 
collector  current  is  raised  rather  high. 
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We  investigate  the  dependence  of  the  capacitive  energy  associated  with  a  circular 
quantum  dot  on  the  number  of  electrons  in  the  dot  and  on  the  parameters  defining  the 
potential  confinement.  Our  results  reproduce  the  shell-filling  behavior  that  has  recently 
been  experimentally  observed  and  allow  us  to  estimate  the  shape  of  the  confinement 
potential  and  the  dot  size  of  the  experiments. 


Keywords:  Quantum  dot,  shell-filling,  charging  energy,  density  functional 


1.  INTRODUCTION 

Recent  advances  in  nanofabrication  techniques 
have  allowed  a  more  detailed  investigation  of  the 
electronic  structure  of  quantum  dots.  In  particular, 
it  has  become  possible  to  measure,  with  greater 
precision  and  reliability,  the  values  for  the  char¬ 
ging  energies  in  a  quantum  dot.  Tarucha  et  ah  [1] 
have  fabricated  a  circular  quantum  dot  in  which 
vertical  confinement  has  been  achieved  by  means 
of  AlGaAs  barriers  12  nm  apart,  while  lateral 
confinement  has  been  obtained  by  etching  and  by 
means  of  the  depletion  due  to  a  lateral  Schottky 
contact.  Tarucha  et  al.  have  evaluated  the  charging 
energies,  which  they  improperly  name  “addition 
energies”,  by  measuring  the  I-  V  characteristic  for 
several  different  values  of  the  gate  voltage:  in  this 


way  it  is  possible  to  construct  the  “diamond 
diagram”,  typical  of  Coulomb  blockade  studies, 
which  yields  directly  the  values  for  the  charging 
energies. 

For  the  first  time,  the  results  in  Ref.  [1]  show 
atomic-like  shell-filling  effects  in  a  quantum  dot,  as 
predicted  in  Refs.  [2],  [4].  In  order  to  understand 
the  meaning  of  these  results,  we  must  first  define 
rigorously  what  the  charging  energy  is.  We  can 
start  from  the  classical  definition  of  the  capacitive 
energy,  i.e.,  the  variation  in  the  potential  energy  of 
a  capacitor  as  a  consequence  of  the  addition  of 
charge.  For  a  quantum  dot  the  charging  energy 
can  be  defined  as  the  variation  of  the  chemical 
potential  as  a  consequence  of  the  addition  of  one 
electron:  Ec=li(N+  If  the  dot  were  a 

classical  capacitor,  this  would  be  a  constant 
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quantity;  in  the  quantum  case  we  can  define  a 
quantum  capacitance  [2],  depending  on  the  num¬ 
ber  N  of  electrons  contained  in  the  dot,  as 
C(N)  =  e2/Ec,  where  e  is  the  charge  of  the  electron. 

As  long  as  we  add  electrons  to  the  same  orbital, 
the  behavior  of  the  quantum  dot  is  quite  similar  to 
that  of  a  classical  capacitor:  the  charging  energy  is 
substantially  constant  and  depends  mostly  on  the 
geometrical  dimensions.  However,  when  we  start 
populating  a  new  orbital,  the  charging  energy 
exhibits  a  peak,  due  to  the  variation  in  the 
quantum  confinement  energy.  Thus,  the  charging 
energy  is  about  constant,  with  a  peak  every  time 
that  a  new  shell  opens.  In  a  rectangular  dot,  for 
example,  each  orbital  accomodates  only  two 
electrons,  so  that  the  charging  energy  has  an 
oscillating  behavior,  as  a  function  of  the  electron 
number.  A  circular  dot,  instead,  has  several 
degeneracies,  which  lead  to  a  much  smaller 
number  of  peaks,  as  will  be  discussed  in  the 
following. 


2.  MODEL 

We  have  considered  a  2D  model  of  a  quantum  dot 
characterized  by  parabolic  confinement  and  hard 
walls  located  at  a  distance  from  the  center 
corresponding  to  the  geometrical  radius.  For 
purposes  of  easy  comparison  with  the  experimen¬ 
tal  results,  we  assume  the  parabolic  potential  given 
by  V=  1/2  m*  u?r 2,  where  r  is  the  radial  coordinate 
and  m *  is  the  electron  effective  mass,  and  express 
the  slope  of  such  potential  by  means  of  the  product 
tuo.H  being  the  reduced  Planck  constant.  The  2D 
approximation  is  allowed  by  the  fact  that  the 
vertical  dimension  of  the  dot  (12  nm)  is  much 
smaller  than  the  lateral  radius,  therefore  only  the 
lowest  state  along  the  vertical  direction  is  occu¬ 
pied,  and  this  represents  just  a  constant  shift  in  the 
energy,  which  does  not  affect  the  results  for  the 
charging  energy.  The  2D  Schrodinger  equation 
can  be  separated  [3]  into  a  straightforward  angular 
equation  and  into  a  radial  equation,  which  is 
solved  by  means  of  a  standard  numerical  finite- 


difference  procedure.  The  two  equations  read: 


#2 


+  v2<S>((j))  =0 


(1) 


d2P(p)  1  dP(p)  2m * 

dp2  +  p  dp2  ft2 


V(p)P(p) 


+  (k2  --^jp{p)  _0, 


(2) 


where  v2  is  the  constant  for  the  separation  of 
variables  and  k2  =  2m*E/h2,  E  being  the  energy 
eigenvalue.  If  we  enforce  periodic  boundary 
conditions,  the  angular  equation  has  the  simple 
solution  $  =  c  exp(±z</>).  When  i/2  =  0,  there  is  just 
one  solution,  while  for  two  different 

degenerate  solutions  exist,  corresponding  to  the 
up  and  down  orientations  of  the  angular  momen¬ 
tum. 

The  potential  V (p)  is  computed  self-consistently 
at  each  iteration  from  the  electron  density 
obtained  at  the  previous  iteration.  It  includes  also 
a  term  accounting  for  the  exchange  and  correla¬ 
tion  interactions,  in  the  framework  of  a  local 
density  functional  approach  [3].  The  iterative 
procedure  is  stopped  when  the  electron  density 
differs  from  that  of  the  preceding  iteration  less 
than  a  given  amount.  Obtaining  convergence  is 
simple  as  long  as  the  Coulomb  interaction  among 
the  electrons  represents  just  a  perturbation  of  the 
quantum  confinement  energy.  This  condition  is 
satisfied  for  very  small  dots,  since  the  electrostatic 
interaction  energy  scales  with  the  inverse  of  the 
distance,  while  the  quantum  confinement  energy 
scales  with  the  inverse  of  the  square  of  the  dot  size. 
Thus,  for  dots  larger  than  about  lOOnm,  particular 
care  has  to  be  taken  to  ensure  convergence,  with 
the  usage  of  underrelaxation  techniques,  which 
help  controlling  the  instabilities  typical  of  fixed- 
point  iteration  schemes. 

The  chemical  potential  is  obtained  applying 
Slater’s  transition  rule  [3],  which  allows  better 
numerical  precision  than  that  obtainable  from  a 
differentiation  of  the  total  energy. 
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3.  NUMERICAL  RESULTS 

AND  DISCUSSION 

The  reason  why  we  have  chosen  the  particular 
potential  landscape  described  in  the  previous 
section  is  that  it  qualitatively  explains  a  feature 
observed  in  the  experimental  results:  the  charging 
energy  tends  to  saturate  to  a  limiting  value  as  if  the 
size  of  the  dot  were  growing  while  the  number  of 
electrons  is  increased,  until  any  further  growth  is 
limited  by  the  presence  of  hard  walls. 

We  have  first  considered  a  quantum  dot  with  a 
geometrical  radius  of  90  nm  and  we  have  computed 
the  capacitive  energy  for  three  different  slopes  of  the 
parabolic  confinement  potential.  The  results  are 
reported  in  Figure  1,  where  the  charging  energy  is 
plotted  versus  the  number  of  electrons  in  the  dot  for 
hu>=4  meV  (thick  line),  3  meV  (thin  line)  and  2.5 
meV  (dashed  line).  In  all  three  cases  the  capacitive 
energy  tends  to  decrease  and  to  saturate  down  to  a 
limiting  value.  This  is  the  consequence  of  the 
variation  of  the  effective  dot  size  (defined  as  the 
area  over  which  the  electron  density  is  nonnegligi- 
ble)  as  a  function  of  the  number  of  electrons:  when 
there  are  just  a  few  electrons,  they  tend  to 
concentrate  in  the  deepest  region  of  the  potential 
landscape,  near  the  dot  center,  while  for  larger 
numbers  of  electrons  such  minimum  is  effectively 
screened  and  the  charge  is  distributed  over  a  larger 
area,  up  to  the  maximum  size  allowed  by  the  hard 
walls.  The  classical  charging  energy  Ecc  for  a 


FIGURE  1  Charging  energy  for  a  quantum  dot  with  a  radius 
of  90  nm  versus  the  number  of  electrons.  The  bare  confinement 
potential  is  parabolic  with  fiu= 4  meV  (thick  line),  3  meV  (thin 
line),  2.5  meV  (dashed  line). 


conducting  disk  with  a  radius  of  90  nm  is  given  [5] 
by  Ecc = e2/(8Re0£r)  =1-94  meV,  where  R  is  the  disk 
radius,  e0  is  the  vacuum  permittivity  and  er  is  the 
relative  permittivity  of  the  medium.  This  is  in  good 
agreement  with  the  limiting  value  of  approximately 
2  meV  obtained  from  the  simulation.  Another 
major  feature  of  the  computed  charging  energy 
consists  in  the  presence  of  peaks  for  N=  2,  6, 12, 16, 
20,  24,  corresponding  to  those  found  in  the 
experimental  results.  Each  of  them  is  associated 
with  the  filling  of  a  shell  (orbital)  and  its  height 
depends  on  the  difference  between  the  energy 
eigenvalue  of  the  new  shell  and  that  of  the  previous 
one.  If  the  confinement  potential  were  perfectly 
parabolic,  without  any  contribution  from  electron- 
electron  interaction,  there  would  be  peaks  only  for 
N= 2,  6, 12,  20,  30,  because  more  degeneracies  are 
present  with  a  parabolic  potential  than  with  a 
generic  potential  characterized  by  circular  symme¬ 
try.  Circular  symmetry  only  implies  that  when  v=0 
there  is  a  double  degeneracy  (the  spin  degeneracy) 
and  when  v=£0  the  degeneracy  is  fourfold  (spin 
degeneracy  plus  angular  momentum  degeneracy). 
We  do  not  observe  any  peak  for  N=  10,  because  the 
energy  splitting  between  the  two  related  orbitals  is 
extremely  small  (see  Ref.  [6]  for  more  details). 

The  saturation  of  the  charging  energy  towards  a 
limiting  value  determined  by  the  maximum  dot 
size  is  apparent  in  Figure  2,  where  we  report  the 
capacitive  energies  for  %u>  =  3  meV  and  a  dot 
radius  of  75  nm  (thick  line),  90  nm  (thin  line)  and 
120  nm  (dashed  line).  When  the  number  of 
electrons  in  the  dot  is  small,  the  effective  size  is 
independent  of  the  geometrical  radius  and  is 
determined  only  by  the  shape  of  the  parabolic 
potential,  therefore  for  small  N  the  three  curves 
overlap.  For  large  numbers  of  electrons,  instead, 
the  three  curves  tend  to  saturate  to  the  corre¬ 
sponding  classical  capacitive  energies  (the  curve 
for  R=  120  nm  does  not  reach  the  limiting  value 
for  the  maximum  number  of  electrons  included  in 
our  plot). 

The  experimental  results  by  Tarucha  et  al.  show 
a  saturation  value  of  approximately  1.25  meV, 
which  would  correspond  to  a  classical  radius  of 
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FIGURE  2  Charging  energy  for  a  quantum  dot  with  a 
parabolic  confinement  potential  defined  by  Hu  =3  meV  and 
with  a  geometrical  radius  of  75  nm  (thick  line),  90  nm  (thin  line) 
and  120  nm  (dashed  line). 


140  nm.  We  have  therefore  performed  a  calcula¬ 
tion  for  a  dot  with  a  geometrical  radius  of  140  nm, 
choosing  a  confinement  potential  with  Hu=  2.48 
meV,  on  the  basis  of  the  best  possible  agreement 
between  the  results  of  the  simulation  and  the 
experimental  data.  The  computed  capacitive  en¬ 
ergy  as  a  function  of  the  electron  number  is 
reported  in  Figure  3:  the  only  significant  discre¬ 
pancy  with  the  experimental  results  is  represented 
by  the  absence  of  the  peak  for  N=  4,  which  cannot 
be  explained  if  we  assume  rigorous  circular 
symmetry.  It  is  however  possible  that  some 
irregularity  exists  in  the  confinement  potential  of 
the  experimental  dot  and  that  it  produces  measur¬ 
able  effects  only  for  small  numbers  of  electrons, 
while  it  is  screened  out  for  larger  values  of  N. 


FIGURE  3  Charging  energy  for  a  quantum  dot  with  a 
parabolic  confinement  potential  defined  by  Hu  =  2 AS  meV  and 
with  a  geometrical  radius  of  140  nm. 


4.  CONCLUSIONS 

The  charging  energy  for  a  circular  model  quantum 
dot  has  been  evaluated  with  the  inclusion  of 
exchange  and  correlation  effects.  The  numerical 
results  are  in  quantitative  agreement  with  experi¬ 
mental  data  that  have  recently  been  obtained  and 
reproduce  the  position  of  the  observed  peaks, 
corresponding  to  shell-filling  events. 
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In  this  paper,  we  present  insights  into  the  transport  properties  and  the  geometrical 
structure  of  resonant  tunneling  devices  that  can  be  obtained  by  the  study  of  their  noise 
properties.  We  stress  the  importance  of  including  noise  behavior  among  the  objectives 
of  device  simulations.  The  reason  is  twofold:  on  one  hand,  as  the  number  of  carriers 
involved  in  device  operation  decreases,  fluctuations  become  more  relevant;  on  the  other 
hand,  in  devices  whose  functionality  is  based  on  quantum  effects,  noise  properties 
strongly  depend  on  the  details  of  device  geometry. 


Keywords:  Resonant  tunneling,  nanoelectronics,  device  modeling,  noise  modeling,  shot  noise 


1.  INTRODUCTION 

In  recent  years,  noise  characterization  has  emerged 
as  a  powerful  tool  for  obtaining  information  about 
the  structure  and  the  transport  properties  of 
nanoscale  devices  complementary  to  those  given 
by  the  DC  characteristics  and  the  small  signal  AC 
response.  In  fact,  since  the  number  of  charge 
carriers  involved  in  device  operation  is  decreased 
with  respect  to  semiclassical  devices,  it  is  apparent 
that  fluctuations,  and  in  particular  those  due  to  the 
granularity  of  charge  (the  so-called  “shot  noise”), 
acquire  an  increasing  importance.  Furthermore, 
noise  in  such  structures  exhibits  a  behavior  which 
is  strongly  dependent  upon  the  details  of  device 
geometry. 


Therefore,  it  is  important  to  extend  the  domain 
of  device  simulations  to  noise  properties.  Here,  we 
focus  on  resonant  tunneling  structures,  for  which  a 
greater  number  of  experimental  data  is  available. 

Since  the  pioneering  work  of  Lesovik  [1]  and  the 
first  experimental  results  of  Li  and  coworkers  [2], 
many  theoretical  studies  [3-6]  and  experimental 
results  [5,7,8]  have  appeared  in  the  literature, 
assessing  that  the  power  spectral  density  S  of  the 
noise  current  in  such  devices  may  be  suppressed 
down  to  half  the  “full”  shot  noise  value  *Sfun  = 
2 q(i)9  due  to  correlation  in  the  motion  of  in¬ 
dividual  electrons  introduced  by  electrostatic  force 
and/or  Pauli  exclusion. 

In  this  paper,  we  presents  insights  into  the 
transport  properties  and  the  geometrical  structure 
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of  resonant  tunneling  devices  that  can  be  obtained 
by  the  study  of  their  noise  properties.  Our 
calculations  are  based  on  a  model  for  transport 
and  noise  in  generic  resonant  tunneling  structures 
which  has  been  presented  elsewhere  [6, 9],  so  that 
only  a  brief  description  will  be  given  in  Section  2, 
while  the  interested  reader  can  find  analytical 
details  in  the  cited  papers.  Numerical  results  and  a 
comparison  with  available  experimental  results 
will  be  shown  in  Section  3,  while  a  discussion  of 
the  results  obtained  and  of  the  future  develop¬ 
ments  ends  the  paper. 

2.  MODEL 

Let  us  consider  the  one-dimensional  structure 
sketched  in  Figure  1:  it  consists  of  three  regions 
ill,  and  Qn  i.e.,  the  left  reservoir,  the  well 
region  and  the  right  reservoir,  respectively,  that 
are  only  weakly  coupled  through  the  two  tunneling 
barriers  1  and  2.  Moreover,  we  suppose  that 
electron  transport  is  well  described  in  terms  of 
sequential  tunneling:  an  electron  in  traverses 
barrier  1,  loses  phase  coherence  and  relaxes  to  a 
quasi-equilibrium  energy  distribution  in  the  well 


1  2  fir 


FIGURE  1  A  generic  resonant  tunneling  structure  consists  of 
three  isolated  regions  H/,  Slw,  H/  weakly  coupled  by  tunneling 
barriers,  indicated  here  with  1  and  2.  Coupling  between 
different  regions  has  to  be  small  enough  to  be  treated  with 
first  order  perturbation  theory. 


region  SlW9  then  traverses  barrier  2  and  leaves 
through  Qr.  Such  hypothesis  is  very  reasonable, 
except  at  millikelvin  temperatures,  where  inelastic 
processes  are  strongly  suppressed  and  no  more 
effective  in  thermalizing  electrons  in  the  well. 

The  typical  resonant  current  peaks  in  the  I—V 
curve  are  due  to  the  shape  of  the  density  of 
longitudinal  states  in  which  is  strongly  affected 
by  confinement:  for  the  material  parameters 
considered  here,  it  has  a  single  narrow  peak  in 
correspondence  of  the  allowed  longitudinal  energy 
level  of  Qw;  the  rate  of  inelastic  scattering 
processes  affects  the  width  of  such  peak  and  in 
our  model  this  effect  is  taken  into  account  through 
a  phenomenological  parameter,  the  mean  free  path 
/,  which  plays  the  role  of  a  relaxation  length  for 
phase  and  energy.  The  density  of  states  in  the  well 
is  calculated  using  a  compact  formula  derived  in 
Ref.  [11]. 

A  state  in  Qs(s  =  l,r,  w)  is  characterized  by  its 
longitudinal  energy  E ,  its  transverse  wave  vector 
kT  and  its  spin  a.  Tunneling  is  treated  as  a 
transition  between  levels  in  different  regions  [10]  in 
which  E,  kT  and  a  are  conserved. 

Following  Davies  et  al.  [4],  we  introduce  the 
“generation”  rates  gi  and  g2i  i.e.,  the  transition 
rates  from  fi/  to  flyv  and  from  Qr  to  obtained 
by  summing  the  transition  rates  given  by  the  Fermi 
golden  rule  over  all  pairs  of  occupied  states  in  the 
initial  region  and  available  states  in  the  final 
region.  Analogously,  we  define  the  “recombina¬ 
tion”  rates,  i.e.,  the  transition  rates  ru  from 
and  Q/,  and  r2 ,  from  flw  to  f the  total  generation 
rate  g  =  g\+g2 and  the  total  recombination  rate 
r  =  r\+r2-  We  refer  the  reader  to  Ref.  [6]  for 
detailed  analytical  expression. 

The  occupation  factor  in  the  well,  which,  under 
the  assumption  of  complete  relaxation,  depends 
only  on  the  value  of  the  quasi-fermi  level  Efw  in  the 
well,  has  to  be  calculated  in  the  steady  state 
condition,  i.e.,  by  imposing  (g)  =  (r),  where  we 
denote  the  steady  state  value  of  a  generic  quantity 
a  as  (a).  The  average  current  is 

(i)  =  <i(g\  -n)  =  qfa-gi). 


(l) 
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The  problem  of  transport  is  solved  self-consis- 
tently,  since  the  charge  accumulated  in  the  well 
affects  the  conduction  band  profile  of  the  struc¬ 
ture.  We  also  need  to  obtain  the  complete  curve  of 
g  and  r  as  a  function  of  the  number  of  electrons  in 
the  well  N  (that  is  to  say,  of  the  quasi-Fermi  level 
in  the  well).  Then,  we  expand  g  and  r  to  first  order 
in  N,  around  the  steady  state  value  N  and 
introduce  the  following  characteristic  times: 


dgl 

rg\  dN 


1  =  dgi 
Tg2  dN 


1 

Tr  1 


drx 


dN 


N=N 


1  _  dri 
~Wi  =  dN 


N=N 


(2) 

(3) 


from  which  we  can  define 


T1  1  =  Tgl  +  Tr\  -  T2  1  =  Tg2  +  Tr2  (4) 

and 

r_1  =  T\X  +  T2l  •  (5) 

The  power  spectral  density  S(uS)  of  the  noise 
current  at  low  frequency  (wt«  1)  can  be  written 
as 


=  (6) 

V  r2  T1  / 

a  detailed  derivation  of  this  result  can  be  found  in 
Ref.  [6];  it  suffices  here  to  say  that  no  additional 
hypothesis  is  required  to  arrive  at  (6). 

An  important  parameter  is  the  noise  suppres¬ 
sion  factor  7,  also  called  “Fano  factor”,  i.e.,  the 
ratio  between  S  (u)  and  the  “full  shot”  noise  value 
Sfvi\\=z2q(i).  From  (1),  (4),  (5)  and  (6),  it  is  app¬ 
arent  that  it  can  reach  a  minimum  of  7  =  0.5  if 
r\  =  r2,  ( g2 )  <  ( r2 )  and  (rx)  <  (gx). 


3.  RESULTS 

We  consider  a  device  with  the  following  layer 
structure:  a  Si-doped  (Nd=  1.4  x  1018cm-3)  500 
nm-thick  GaAs  buffer  layer,  an  undoped  20  nm- 


thick  GaAs  spacer  layer  to  prevent  silicon  diffu¬ 
sion  into  the  barrier,  an  undoped  11.5  nm-thick 
AlGaAs  barrier  (barrier  2),  an  undoped  5  nm- 
thick  GaAs  quantum  well,  an  undoped  10  nm- 
thick  AlGaAs  barrier  (barrier  1),  an  undoped  10 
nm-thick  GaAs  spacer  layer  and  a  Si-doped  500 
nm-thick  cap  layer.  The  aluminum  mole  fraction 
in  both  barriers  is  0.36.  Noise  measurements  on 
such  structure  as  a  function  of  current  and 
temperature  have  been  presented  in  Ref.  [8]. 

In  Figure  2  the  experimental  forward  I-V 
characteristic  at  77  K  is  compared  with  the 
simulation  result.  The  mean  free  path  is  chosen 
as  a  fitting  parameter,  and  is  equal  to  15  nm.  We 
have  found  that  the  so  called  “peak  to  valley 
ratio”,  i.e.,  the  ratio  between  the  peak  current  and 
the  valley  current,  is  almost  linearly  dependent  on 
the  mean  free  path,  while  the  other  parameters  are 
practically  independent.  For  the  structure  being 
investigated,  a  mean  free  path  close  to  15  nm 
seems  to  be  the  best  fit  at  all  considered 
temperatures  (since  the  mean  free  path  accounts 
for  all  randomizing  effects,  it  seems  that  some 
temperature  independent  cause,  e.g.,  interface 
roughness,  is  predominant);  therefore,  the  decrease 
of  the  peak  to  valley  ratio  with  increasing 
temperature  seems  to  be  due  only  to  the  Fermi 
distribution  spreading. 

The  noise  current  power  spectral  density  5  as  a 
function  of  current  is  plotted  in  Figure  3  for  three 
different  temperatures,  and  for  very  low  bias.  As 
expected,  at  equilibrium  S  tends  to  the  thermal 


FIGURE  2  Forward  I-  V  curves  at  77  K:  comparison 
between  experiment  (solid)  and  simulation  with  mean  free  path 
15  nm  (dotted). 
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FIGURE  3  Noise  current  power  spectral  density  as  a  function 
of  current  density  for  very  low  biases  at  three  different 
temperatures:  14  K  (dotted),  77  K  (solid),  155  K  (dashed).  The 
straight  line  is  S{xl] i  =  2 q(i). 


noise  value  S  =  AGkBT ,  while,  as  the  bias  is 
increased,  it  approaches  the  2 q{i)  curve.  This  is 
an  important  check  for  the  validity  of  our  noise 
model. 

In  Figure  4,  the  noise  suppression  factor  7  as  a 
function  of  current  in  the  resonant  region  of  the 
I-  V  curve  is  plotted  for  different  temperatures 
and  compared  with  the  experimental  results.  As 


FIGURE  4  Noise  suppression  factor  as  a  function  of 
normalized  current  at  the  temperatures  of  14  K  (dotted),  77  K 
(solid),  155  K  (dashed):  comparison  between  experiments  (a) 
and  simulations  (b).  Biases  lower  than  that  of  the  resonant  peak 
are  considered  and  the  current  at  the  resonant  peak  is  taken  as 
unity. 


can  be  seen,  7  increases  with  increasing  tempera¬ 
ture.  This  behavior  is  described  in  [6]. 

For  the  same  structure,  in  Figure  5,  7  is  plotted 
as  a  function  of  the  applied  voltage  at  the 
temperature  of  77  K.  For  bias  voltages  in  the 
negative  differential  resistance  (NDR)  region, 
enhanced  shot  noise  is  to  be  expected,  while  for 
voltages  greater  than  that  corresponding  to  the 
valley  current,  full  shot  noise  is  obtained  (Pauli 
exclusion  and  Coulomb  repulsion  are  no  more 
effective  in  correlating  current  pulses).  For  the 
latter  result,  an  experimental  evidence  can  be 
found  in  [8],  while,  for  the  former,  measurements 
are  in  progress. 

The  enhanced  shot  noise  in  the  NDR  region 
depends  on  the  fact  that  the  characteristic  time  rgl 
is  negative,  i.e.,  the  transition  rate  gx  increase  with 
increasing  N.  The  reason  is  that  the  peak  in  the 
density  of  states  is  below  the  conduction  band 
bottom  of  the  cathode:  when  an  electron  enters  the 
well,  the  conduction  band  bottom  of  the  well  is 
raised,  and  more  states  are  available  for  tunneling 
from  the  left  electrode,  so  that  electron  crossings 
through  the  whole  structure  are  positively  corre¬ 
lated. 


4.  DISCUSSION 

We  have  shown  that  numerical  modeling  of  noise 
in  resonant  tunneling  devices  provides  new  insights 
into  device  physics  and  structure,  allows  us  to 
recover  the  results  of  experiments  with  a  reason- 


FIGURE  5  Theoretical  noise  suppression  factor  at  77  K  as  a 
function  of  voltage. 
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able  accuracy  and  to  predict  interesting  new 
results.  The  results  obtained  give  us  confidence  in 
the  model  for  transport  and  noise  in  resonant 
tunneling  structures  described  in  [6,9, 11]. 

Further  developments  will  include  a  more 
refined  model  to  predict  experimental  data  with 
better  accuracy;  in  particular,  greater  attention  will 
be  devoted  to  the  connection  between  regions  in 
which  transport  can  be  described  as  semiclassical 
and  regions  in  which  the  quantum  nature  of 
transport  must  be  taken  into  full  account. 
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We  develop  a  quantitative  model  of  the  impact-ionizationand  hot-electron -injection 
processes  in  MOS  devices  from  first  principles.  We  begin  by  modeling  hot-electron 
transport  in  the  drain-to-channel  depletion  region  using  the  spatially  varying  Boltzmann 
transport  equation,  and  we  analytically  find  a  self  consistent  distribution  function  in  a 
two  step  process.  From  the  electron  distribution  function,  we  calculate  the  probabilities 
of  impact  ionization  and  hot-electron  injection  as  functions  of  channel  current,  drain 
voltage,  and  floating-gate  voltage.  We  compare  our  analytical  model  results  to 
measurements  in  long-channel  devices.  The  model  simultaneously  fits  both  the  hot- 
electron-injection  and  impact-ionization  data.  These  analytical  results  yield  an  energy- 
dependent  impact-ionization  collision  rate  that  is  consistent  with  numerically  calculated 
collision  rates  reported  in  the  literature. 


Keywords:  Impact  ionization,  hot  electron  injection,  floating  gate  devices,  silicon  electron 
transport,  MOSFET  modeling 


We  develop  a  quantitative  analytical  model  of  the 
impact-ionization  and  hot-electron -injection  pro¬ 
cesses  in  MOS  devices  that  is  derived  consistently 
from  a  single  spatially  varying  hot-electron  dis¬ 
tribution  function.  This  approach  not  only  pro¬ 
vides  a  useful  circuit  model,  but  also  complements 
and  validates  numerical  results  from  Monte  Carlo 
simulations. 

We  measure  hot-electron -injection  (gate)  and 
impact-ionization  (substrate)  currents  using  an  n- 
type  MOSFET  built  with  a  high  substrate  doping 
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(Ixl017cm~3)  operating  with  subthreshold  cur¬ 
rents.  Figure  1  illustrates  the  cross  section  of  the 
device.  In  subthreshold  the  channel  current  of  a 
MOSFET  is  sufficiently  small  so  that  the  mobile 
charge  does  not  affect  the  surrounding  electro¬ 
statics,  resulting  in  a  constant  surface  potential. 
Consequently,  by  operating  the  MOSFET  in 
subthreshold,  we  obtain  a  high  field  region  whose 
properties  are  independent  of  the  channel  current. 
This  higher  substrate  doping  is  consistent  with  a 
0.3  pm  channel  length  CMOS  process;  thus,  these 
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z=0 


FIGURE  I  Cross  section  of  the  MOSFET  device  we  used  to 
measure  the  hot-electron  effects.  It  uses  a  highly  doped  (1  x 
1017cm-3)  substrate  to  achieve  a  high  threshold  voltage  which 
allows  hot-electron  injection  for  bias  current  levels  in  subthres¬ 
hold.  The  n  well  isolates  the  highly  doped  substrate  region  from 
the  surrounding  substrate,  and  allows  measurement  of  substrate 
current.  Holes  resulting  from  impact  ionization  are  measured  at 
the  p  base  contact.  The  hot-electron  injection  process  is 
identical  for  the  FET  with  or  without  the  isolating  n  well. 
Inset:  the  electron  is  accelerated  through  the  drain  depletion 
(path  1),  and  when  it  gains  energy  greater  than  the  Si — Si02 
barrier,  the  electron  is  injected  over  the  Si — Si02  barrier  to  the 
floating-gate  (path  2). 


effects  are  directly  applicable  to  modern  pro¬ 
cesses. 

For  an  electron  to  reach  the  floating  gate,  it 
must  have  energy  greater  than  the  oxide  barrier 
height  and  must  be  directed  towards  the  Si02 
when  the  electron  reaches  that  energy.  The  high 
electric  fields  in  the  drain-to-channel  depletion 
region  accelerate  channel  electrons  to  high  energies 
(path  1).  The  high  substrate  doping  increases  the 
threshold  voltage  6  V)  and  the  drain-to-channel 
electric  field,  which  generates  high-energy  elec¬ 
trons  at  subthreshold  currents  for  positive  gate-to- 
drain  voltages;  therefore,  an  electron  surmounting 
the  Si — Si02  barrier  will  be  transported  to  the 
gate  by  the  resulting  oxide  field  (path  2). 

As  an  electron  gains  energy  due  to  the  electric 
field  in  the  z  direction,  the  electron  is  confined  by 
the  electric  field  and  the  silicon — silicon-dioxide 
interface  in  the  y  direction.  The  resulting  electron 
distribution  in  y  and  ky  is  nearly  independent  of 
the  electron  distribution  in  the  other  coordinates; 
therefore,  some  electrons  at  >>  =  0  are  directed 


toward  the  Si02,  and  these  electrons  will  enter  the 
Si02  if  they  have  gained  sufficient  energy. 


1.  ELECTRON  TRANSPORT  IN 
THE  DRAIN  TO  CHANNEL 
DEPLETION  REGION 


We  begin  by  modeling  hot-electron  transport  in 
the  drain-to-channel  depletion  region  using  the 
spatially  varying  Boltzmann  transport  equation. 
We  can  simplify  the  general  Boltzmann  equation 
to  a  1-D  problem  along  the  channel  (z)  axis  [1]; 
Figure  2  shows  the  conduction  band  as  a  function 
of  position  through  the  MOSFET’s  channel 
region.  Following  a  similar  procedure  to  Baraff 
[2],  we  get 


3T  pSf  ^  p 
te  +  q£dE  +  q£ 


i-C2a/ 
(E  d( 


m*(c) 

(c 


Si/), 


(1) 


where /(z,  c()  is  the  distribution  function,  £(z)  is 
the  component  of  the  electric  field  in  the  z  direction, 
c  is  the  magnitude  of  the  average  momentum 
vector,  (  is  the  cosine  of  the  angle  of  momentum 
vector  and  the  z  axis,  and  S(f)  is  the  collision 
operator.  E—  c2/m*(c)  is  the  electron  energy,  where 
m*(c)  is  the  effective  mass  of  the  electron  that 
depends  upon  the  silicon  band  structure. 

Starting  from  ConwelPs  optical-phonon  colli¬ 
sion  operator  [3],  we  derive  the  following  approx¬ 
imate  optical-phonon  collision  operator  for  E  » 
Er  [1]: 


Sop  (/) 


;  c(E)  \  /. El  d2f  Er  df\ 
m*(E)J\2\dE2  A  dEj ’ 


(2) 


where  ER  is  the  energy  of  an  optical  phonon 
(£^  =  63meV  in  Si).  A  similar  expansion  and 
simplification  has  been  done  for  polar  optical 
phonons  [4].  The  mean  free  length  for  phonon 
collisions  (A)  is  known  to  be  approximately 
constant  for  high  energies.  We  can  remove  the 
bandstructure  effects  in  [1]  by  developing  our 
collision  models  only  in  terms  of  a  mean  free 
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FIGURE  2  Band  diagram  illustrating  hot-electron  injection  in  a  MOSFET  biased  in  subthreshold.  The  appropriate  variables  in  the 
Boltzmann  transport  equation  and  its  variable  transformations  are  shown  on  the  graphs,  (a)  Band  diagram  along  the  surface  of  the 
Si_Si02  barrier.  This  region  is  the  lowest  local  potential  in  either  material;  therefore  the  electrons  are  most  likely  to  travel  along  this 
path.  This  region  corresponds  to  path  1  in  the  inset  in  Figure  1.  (b)  Band  Diagram  of  at  the  drain  edge.  This  region  corresponds  to 
path  2  in  the  inset  in  Figure  1. 


length  and  terms  of  c(E)/m*(E).1  Phonons  have 
momentum,  and  the  total  momentum  involved  for 
a  phonon  absorption  or  emission  must  be  con¬ 
served.  To  precisely  model  this  effect,  one  would 
need  to  know  the  distribution  function  of  momen¬ 
tum  for  the  phonons  in  the  drain-to-channel 
depletion  region.  Elsewhere  we  show  that  the 
scattering  of  the  momentum  distribution  has  a 
small  effect  on  our  zeroth  order  expressions  [1]. 

Most  proposed  impact-ionization  collision  ope¬ 
rators  can  be  formulated  in  general  as 

S(f)  —  _  A _ =  _  _ l _ f  (3) 

UJion‘  rUE)  m*(E)  L(Ey 

where  rion  is  the  mean  free  time  for  an  impact 
ionization  collision,  and  L(E)  is  the  mean  free 
path,  which  is  a  function  of  the  electron  energy. 
We  propose  the  following  model  for  the  energy 
dependence  for  the  impact-ionization  mean-free 
length 


L(E)  =  (0.181A)  exp 


/  /  119eV  \ 

\\l  E  —  0.95  eV  J  ’ 


(4) 


which  is  based  on  our  experimental  measurements 
of  the  impact-ionization  mean  free  length,  and 
corresponds  to  previous  numerical  calculations 
[5-7].  Figure  3  shows  our  functional  form  with 
these  three  numerically  calculated  models.  We 
have  assumed  a  constant  velocity  of  8.1  x  106  cm/s 
in  converting  from  L(E)  to  impact-ionization 
scattering  rate,  since  our  measured  data  is  directly 
related  to  L(E).  This  functional  form  is  a  curve  fit 
to  experimental  data  of  L(E)  derived  from  our 
experimental  measurements  of  hot-electron-injec- 
tion  and  impact-ionization  currents  in  Section  III. 


2.  SOLUTION  OF  THE  TRANSPORT 
EQUATION 


We  analytically  solve  the  resulting  Boltzmann 
transport  equation, 


df  ,  ~C2d/ 

dE  q  (E  d( 


E\  d2f  f 
2\(dE2  L(E)  ’ 


(5) 


Canceling  out  the  effects  of  the  bandstructure  may  limit  the  predictive  power  of  this  model.  This  insight  by  Karl  Hess  is 
appreciated. 
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FIGURE  3  Plot  of  previous  calculations  of  impact-ionization 
rate  versus  electron  energy  in  silicon  and  our  derived  impact- 
ionization  rate  from  our  measured  impact-ionization  and  hot- 
electron  injection  data.  We  have  assumed  a  constant  velocity, 
since  our  model  measures  the  impact-ionization  mean-free 
length.  Our  measured  data  is  directly  related  to  L(E )  and  not 
impact-ionization  scattering  rate. 

for  a  self-consistent  distribution  function  using  a 
two-step  process.  Elsewhere,  we  show  that  the 
transport  along  £=1  for  hot-electron  injection 
and  impact  ionization  closely  approximates  the 
exact  solution  [1]  for  clarity,  we  will  only  consider 
the  C  =  1  case  here.  In  the  first  step,  we  solve  for  the 
average  hot-electron  trajectory  in  energy  and 
direction  as  a  function  of  position  through  the 
depletion  region.  The  average  hot-electron  trajec¬ 
tory  is  the  flow  line  for  the  hyperbolic  P.D.E. 
operator,  and  is  related  to  the  numerical  method 
that  Budd  presented  previously  [9].  In  this  model, 
the  average  electron  starts  gaining  energy  at  the 
position  (zcrit  in  Fig.  2)  where  the  phonon  restoring 
force  is  equal  to  the  energy  increase  due  to  the  local 
electric  field  (£(z)).  This  breakaway  field -the 
minimum  electric  field  at  which  the  electron  gains 
energy  at  the  same  rate  as  it  loses  energy  to  phonon 
collisions  -  is  expressed  as  ER/qX ,  which  for  our 
parameters  is  9.7  V/jim.  The  average  energy,  £j(z), 
that  the  electron  gains  after  reaching  zcrit  is 


or  the  difference  between  the  potential  from  zcrit  to 
the  position  z  in  the  drain-to-channel  depletion 
region,  and  the  number  of  phonon  collisions  in 
this  region.We  show  the  electron  in  Figure  2 
taking  a  linear  path  because  of  the  functional  form 
of  Eq.  (6). 

In  the  second  step,  we  solve  for  the  electron 
distribution  function  around  this  average  electron 
trajectory.  In  this  coordinate  system,  phonon 
collisions  diffuse  the  electron  distribution  spatially, 
and  impact-ionization  collisions  remove  high- 
energy  electrons.  To  simplify  the  analysis,  we 
assume  that  the  electron  leaving  at  zcrit  dominates 
the  behavior  of  hot-electron  injection  and  impact- 
ionization  for  a  wide  range  of  drain  voltages;  the 
limitations  of  this  approximation  are  illustrated  in 
Figure  4.  Using  a  more  complicated  initial  and 
boundary  conditions,  f(z,E)  nearly  follows  an 
effective  temperature  solution  for  high  electron 
energies,  and  f(z,E)  is  the  convolution  of  several 
Gaussians  at  low  energies.  From  this  analysis,  the 
solution  for  the  distribution  function ,f(z,E\  is 


/(z,£)  =  exp 


A 


Z  ZCT[  t 


»)■) 


a(z,E), 

(7) 


where  a(z,  E)  models  the  electrons  lost  to  impact 
ionization,  and  is  approximated  by 


(8) 

This  solution  shows  that  the  assumption  of  a 
constant  electron  temperature  is  not  valid  at 
energies  at  which  impact  ionization  and  hot- 
electron  injection  occur. 


3.  COMPARING  THEORY  WITH 
EXPERIMENT 


Ex{z)  =  qV(z)  -  qV(zcrit)  -  ERZ-^*,  (6) 


From  the  electron  distribution  function  in  (7),  we 
can  calculate  the  probabilities  of  impact  ionization 
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FIGURE  4  Picture  of  the  distribution  function  for  an  electron  in  the  drain-to-channel  beyond  z  =  zcrit*  This  figure  compares  our 
approximate  model  to  the  solution  using  the  exact  conditions  around  zcrit.  For  energies  at  or  below  the  average  electron  energy,  the 
distribution  function  shows  the  cumulative  effect  of  electrons  leaving  the  conduction  band  after  zcrit.  For  large  positive  energies,  the 
distribution  function  does  not  change  as  fast  as  the  Gaussian,  but  rather  at  a  slope  due  to  an  effective  temperature. 


and  hot-electron  injection  as  functions  of  channel 
current  and  drain-to-channel  voltage  ($dc).  We 
use  two  free  parameters,  A  and  Box  >  as  well  as  our 
functional  form  for  L(E).  The  hot-electron - 
injection  efficiency -the  ratio  of  the  injection 
current  (7inj)  and  the  source  current  (/,) -is 
approximately  given  by 


^  =  52exp 

's 


A  (Eos-EijdyY1  7.102  \ 

d  “  Zcrit  \  2Er  /  E\  (<7)  J 

(9) 


where  A  is  equal  to  6.5  nm,  Eox  «  2.8  eV  is  the 
Si — Si02  barrier  height  at  the  drain,  7?2  =  4.55  x 
10-3,  d($ dc)  is  the  width  of  the  drain-to-channel 
depletion  region,  y/E\(d)  —  \fWJc  —  and 

3>crit  is  the  potential  drop  from  z  =  0  to  z  =  zcr it.  Our 
experimental  data  on  the  Early  voltage  versus  4>dc 
show  that  the  channel  doping  profile  is  approxi¬ 
mately  a  step  junction  for  a  fixed  gate  voltage  [1], 
Figure  5  shows  measured  data  of  hot-electron - 
injection  efficiencies  as  a  function  of  drain-to- 
channel  voltage  for  two  channel  currents;  the  hot- 
electron -injection  efficiency  is  independent  of 
source  current.  Figure  5  shows  (9)  fitted  to  the 


injection  efficiency  data.  The  curve  fit  shows  close 
agreement  to  (9)  except  at  large  $dc  (>  5.0  V),  due 
to  average-electron  energy  being  near  the  energy  of 
the  silicon — silicon-dioxide  barrier,  and  at  small 
5>do  probably  due  to  the  simplified  modeling  of  the 
band-structure  effects  in  the  collision  operators. 

The  impact-ionization  efficiency -the  ratio  of 
the  substrate  current  (7sub)  and  the  source  current 

(4)-  is 


Jo 


We  get  an  approximate  solution  by  substituting 
(4),  (8)  and  expanding  the  function  in  the  exponent 
around  the  function’s  maximum  value  in  E.  We 
show  the  general  solution  elsewhere  [1];  the 
solution  for  substrate  doping  of  Na=  1017cm~3  is 


(H) 
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Drain-to-channel  potential  (V) 

FIGURE  5  Measurements  of  hot-electron -injection  efficiency  verses  drain-to-channel  voltage  for  two  values  of  source  current. 
The  drain-to-channel  voltage  is  computed  from  the  source  current  and  the  drain-to-source  voltage.  For  each  sweep,  we  used  a 
constant  gate  voltage  to  chose  a  particular  channel  current;  the  actual  oxide  barrier  height  changes  slightly  due  to  image  force 
lowering,  because  the  floating-gate-to-drain  voltage  is  not  constant. 


Figure  6  shows  experimental  measurements  of  a 
versus  drain-to-channel  potential.The  solid  line  is 
the  curve  fit  of  (1 1)  to  the  experimental  data;  the  fit 
closely  agrees  with  the  measured  data.  From 
measured  values  of  a  versus  $dc,  our  analytical 
model  allows  us  to  measure  the  energy-dependent 


FIGURE  6  Measurements  of  impact-ionization  efficiency  vs. 
drain  to  channel  voltage  for  two  source  currents  (gate  voltages). 
We  plot  a  curve  fit  to  the  analytic  model  in  (11);  the  model 
closely  agrees  with  the  experimental  data. 


impact-ionization  collision  rate  from  experimental 
data;  (4)  is  a  curve  fit  to  these  data. 
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Inclusion  of  Bandstructure  and  Many-Body  Effects 
in  a  Quantum  Well  Laser  Simulator 

F.  OYAFUSO*,  P.  von  ALLMEN,  M.  GRUPEN  and  K.  HESS 
Beckman  Institute,  University  of  Illinois,  Urbana,  IL  61801 


A  self-consistent  eight  band  k-p  calculation,  which  takes  into  account  strain  and 
includes  Hartree,  exchange,  and  correlation  terms  (determined  from  a  local  density 
approximation)  is  incorporated  into  a  QW  laser  simulator  (MINTLASE-II).  The 
computation  is  performed  within  the  envelope  function  approximation  for  a  super¬ 
lattice,  in  which  all  spatially  varying  terms  of  the  k-p  Hamiltonian,  including  the 
exchange  and  correlation  energies  are  expanded  in  plane  waves.  The  k-p  eigenvalue 
equation,  and  Poisson’s  equation  are  solved  iteratively  until  self-consistency  is  attained. 
Results  from  the  k-p  calculation  are  exported  to  MINILASE-II  via  a  density  of  states 
and  an  energy  dependent  optical  matrix  element  factor,  renormalized  by  a  Coulomb 
enhancement  factor  to  account  for  electron-hole  attraction.  Results  are  presented  for 
the  gain  spectrum  and  modulation  response  for  a  Ga0.8lno.2  As/Al0.i  Ga0.9  As  quantum 
well  laser  with  and  without  the  inclusion  of  the  Coulomb  enhancement  factor. 


Keywords:  Gain,  k-p,  laser,  quantum  well,  modulation  response,  bandstructure 


MINILASE-II  [1]  is  a  semiconductor  laser  simu¬ 
lator  that  self-consistently  solves  Poisson’s  equa¬ 
tion,  the  carrier  transport  equations  (both  drift 
diffusion  and  thermionic  emission  across  hetero¬ 
structures),  quantum  well  (QW)  capture,  and  the 
photon  rate  equations  for  arbitrary  two  dimen¬ 
sional  geometries.  Many  important  effects  arise 
from  the  bandstructure  near  the  quantum  well. 
However,  it  is  computationally  impractical  to 
perform  an  accurate  self-consistent  bandstructure 
calculation  within  the  laser  simulator  at  each 
iteration.  The  purpose  of  this  paper  is  to  explain 
how  we  connect  a  separate  eight  band  k-p 
superlattice  calculation,  including  many-body 
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effects,  with  MINILASE-II  and  to  present  results 
for  the  gain  and  modulation  response  for  a 
strained-layer  Ino.2Gao.8As/Alo.1Gao.9As  quantum 
well  laser. 

The  k-p  calculation  involves  diagonalizing  a 
Hamiltonian  which  at  the  center  of  the  Brillouin 
zone  can  be  expressed  as 

H%\khkz)  =  (f„(0)  +  fa 

+  (k||  +  kzez)  ■  P„(  (1) 

+  Ysrnek«kP  +  H{slmm) 

a{3 
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where  the  indices  run  over  the  bands  in  our  basis 
set  B ,  are  the  momentum  matrix  elements,  and 
rjjf  are  renormalization  constants  describing  the 
contribution  from  bands  not  contained  in  B.  To 
characterize  the  bandstructure  of  crystals  with 
zincblende  symmetry  near  T,  it  is  sufficient  to 
include  the  heavy  hole  (HH)  and  light  hole  bands 
(LH)  (Tg),  the  split-off  bands  (SO)  (T$)  and  the 
lowest  conduction  bands  (T£).  /f (stram)?  a  k- 
independent  term,  describes  the  strain,  which  is 
assumed  to  be  confined  to  the  well  region.  The 
resulting  8x8  Hamiltonian  has  been  described  in 
the  literature  [2,  3]. 

For  the  superlattice  calculation,  z  is  taken  to  be 
the  growth  direction  and  the  usual  substitution 
kz-^dji  is  made  to  obtain  the  effective  mass  Eq.  [4], 

Y^Hme(khd2/i)<l>f^\z)  =  E^4^\z),  (2) 


where  kq  is  the  wave  number  for  the  ID  Brillouin 
zone  of  the  superlattice,  and  <j>ffikq\z),  the  super¬ 
lattice  envelope  functions,  are  related  to  the  wave 
functions  to  second  order  by 


V'nki  =  exp(/k||  •  r)  ^2  (  um0  +  ^ 

meB  ' 


k||  •  P bn 


(3) 


Et{ 0)  “  Em{ 0) 


determined  from  [6]  except  for  the  conduction 
band  and  valence  band  deformation  potentials 
used  to  determine  band  edge  shifts  due  to  strain 
and  the  fraction  conduction  band  discontinuity, 
which  are  not  well  known.  We  assume  a  fraction 
conduction  band  discontinuity  of  0.7  and  1 1  eV  for 
the  difference  in  conduction  band  and  valence 
band  deformation  potentials. 

To  account  for  carrier -carrier  interactions 
additional  terms  which  depend  on  electron  and 
hole  densities  are  added  as  diagonal  terms  in  the 
k  p  Hamiltonian.  The  direct  Coulomb  interaction 
gives  the  Hartree  potential  VH  which  is  determined 
by  solving  the  ID  Poisson  equation: 

dz(£d2V„)  =  -~p3D(z)  (4) 

Pid{z)=?j-  f  m,z)\2f(Ej{k),n(n2D)) 

^  jbz{2'k)  j 

(5) 

where  the  envelope  functions  have  been  normal¬ 
ized  to  the  superlattice  period  L.  The  Fermi 
distribution  depends  on  the  energy  disper¬ 

sions  found  from  the  kp  calculation  and  on  the 
quasi-chemical  potential  fi,  determined  from  the 
input  parameter  n2n,  the  carrier  density  per  unit 
surface  in  one  period  of  the  superlattice. 

«2*  =  W  -^-2f(Ej(k),v)  (6) 

j  Jbz  (27 r) 


The  band  parameters  which  enter  the  Hamiltonian 
are  allowed  to  have  different  values  in  the  well  and 
the  barrier  regions  [5],  and  the  resulting  operator  is 
Fourier  transformed.  For  a  superlattice  period  of 
500  A,  typically  about  thirty  Fourier  components 
are  required  to  ensure  convergence  of  the  ground 
state  eigenenergies  to  within  1  peV.  The  figures 
throughout  this  paper  were  generated  for  a  super¬ 
lattice  consisting  of  Al01Ga0.9As  barriers  of  400  A, 
and  In0.2Ga0.gAs  wells  of  80  A.  The  barriers  are 
wide  enough  to  decouple  adjacent  wells  so  that  the 
superlattice  is  in  practice  a  collection  of  indepen¬ 
dent  quantum  wells.  Most  band  parameters  were 


The  interaction  of  a  carrier  with  its  exchange- 
correlation  hole  lowers  its  energy  and  results  in  a 
narrowing  of  the  bandgap.  This  effect  is  taken  into 
account  in  the  local  density  approximation.  We 
use  an  interpolated  expression  derived  by  Hedin 
and  Lundquist  [7]  for  three  dimensional  systems, 
and  treat  electrons  and  holes  as  separate  plasmas. 
The  bandgap  renormalization  (BGR)  obtained  is 
in  qualitative  agreement  with  experiment  and  can 
be  improved  by  using  an  expression  more  appro¬ 
priate  for  2D  systems. 

Because  these  additional  terms  depend  on  the 
eigenfunctions  determined  from  the  k  p  calcula- 
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tion,  an  iterative  approach  must  be  used.  First  the 
k  p  Hamiltonian  is  diagonalized  for  a  suitable 
initial  guess  for  the  Hartree,  exchange  and 
correlation  energies.  For  a  given  2D  carrier 
density,  the  chemical  potentials  are  then  computed 
assuming  a  Fermi  distribution  function.  The  z 
dependence  of  the  3D  carrier  density  can  then  be 
determined  and  used  to  compute  a  new  estimate  of 
the  Hartree  and  exchange  and  correlation  poten¬ 
tials.  To  assure  convergence,  an  underrelaxation 
scheme  is  used  for  the  exchange  and  correlation 
potentials.  The  approach  is  illustrated  in  the  flow 
chart  in  Figure  1 . 

Once  convergence  has  been  attained,  optical 
matrix  elements  (OME)  given  by  Mnnf  —  (tl>nk\\\ 
p|VVfc||)  are  computed  for  later  use  in  MINILASE 
II.  The  square  of  the  OME  gives  the  strength  of 
the  electron-hole  coupling  through  the  photon 
interaction  and  therefore  enters  the  expressions  for 
spontaneous  and  stimulated  emission.  Figure  2a 
shows  the  OME  dispersion  for  TE  polarization 
and  two  directions  of  ky,  parallel  to  and  perpendi¬ 
cular  to  e.  At  T,  CB1  couples  only  to  HH1  and 


Solve  the  k-p  eigenvalue  problem: 
(Hm>+  Vxc  +  V„  -  E)VF  =  O 

Compute  the  quasi-chemical  potentials: 


Solve  for  wave-vector  independent  dens! ties: 

|  P~ 

........ _ _  1 

:  Solve  Poisson’s  equation  for  the  electrostatic  potential; 

I  V.(eVV„)  « 


£ 

c 


Recompute  the  exchange-correlation  potentials 
within  LDA. 


Mix  new  and  old  V  (Underrelaxation) 


FIGURE  1  Flowchart  for  self-consistent  k  p  calculation. 


LH1  because  of  the  orthogonality  relation  between 
the  envelope  functions.  However,  away  from  T, 
band  mixing  allows  formerly  forbidden  transitions 
to  take  place.  As  expected  for  this  polarization,  the 
coupling  at  T  between  *CB1  and  HH1  is  about 
three  times  stronger  than  between  CB1  and  LH1 
[8]. 

Due  to  the  Coulomb  attraction  between  elec¬ 
trons  and  holes,  spontaneous  and  stimulated 
emission  are  enhanced.  This  Coulomb  enhance¬ 
ment,  which  may  be  thought  of  a  renormalization 
of  the  optical  matrix  elements,  is  computed 
external  to  MINILASE  II  for  various  carrier 
densities  and  lasing  energies.  The  expression  used 
for  the  Coulomb  enhancement  factor  has  been 


Ikl  /  (2rc/a) 


Transition  Energy  (eV) 

FIGURE  2  (a)  Optical  matrix  elements  for  ky  parallel  to 
(solid)  and  normal  to  (dashed)  the  polarization.  Transitions 
between  the  lowest  conduction  subband  CB1  and  the  valence 
subbands  HH1,  HH2  and  LH1  are  shown,  (b)  Density 
dependence  of  the  Coulomb  enhancement  factor. 
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derived  in  the  literature  [9]  and  is  given  by 

_ 1__ 

1  -  q(k) 

where 

(7) 

my)  uy)+M^Lil 

Ti  i(Ec(k’)-Ev(k')-v)  +  \/T 

Here  /„  and  fp  are  Fermi  occupation  factors,  t  is 
the  carrier-carrier  scattering  time,  assumed  to  be 
100  fs,  and  V(s)  is  the  2D  screened  Coulomb 
potential  which  in  the  static  long  wavelength  limit 
of  RPA  is  given  by 


where 


V(s)(q) 


lire1  1 
e0q  1  +  Kjq 


lire2  /  dn  ^  dp  \ 
e0  \dtin  dfJLp) 

is  the  inverse  screening  length.  Figure  2b  shows  the 
renormalization  factor  for  the  CB1-HH1  transi¬ 
tion  between  the  lowest  conduction  subband  and 
lowest  valence  subband  for  several  carrier  densities 
above  threshold.  In  this  calculation,  an  axial 
average  of  the  matrix  elements  is  performed,  so 
that  the  OME  may  be  treated  strictly  as  a  function 
of  energy  rather  than  in-plane  wave  vector  ky.  We 
see  that  the  modest  enhancement  of  the  OME 
decreases  as  the  carrier  density  increases.  This 
result  stems  from  the  fact  that  screening  is 
enhanced  at  higher  densities. 

The  results  of  these  calculations  are  exported  to 
MINILASE II  in  three  forms:  the  spatial  profile  of 
the  envelope  functions,  a  density  of  states  and  an 
optical  matrix  element  factor  P„(0)  (associated 
with  each  conduction  band  energy  grid  point  n) 
given  by 


Pn{V)  =  £  f  ^||e  •  My(k,)|2Tfc(£*(k,)) 

ij  J 

«(^(kj|)-^v(k||)-fi)  (8) 

The  direction  of  the  light  polarization  is  e,  and  r]n 
is  a  hat  function  with  support  on  the  energy  range 
defined  by  grid  points  n—  1  and  n+l  and  whose 
area  is  normalized  to  unity.  The  sum  extends  over 
all  conduction  subbands  i  and  valence  subbands  j. 
The  gain  coefficient  is  then  given  by 

s(0)(fi)  ~  +M£n  -  n)  - 1) 

n  (9) 

pH{n)C(n,T) 

where  C  is  a  Lorentzian  broadening  function  and 
fe  and  fh  are  the  non-equilibrium  electron  and  hole 
distribution  functions,  respectively.  Minilase  II 
incorporates  the  k  p  data  in  an  iterative  manner. 
That  is,  the  simulator  first  solves  the  carrier 
transport  equations,  Poisson’s  equation  and  the 
photon  rate  equations  through  Newton’s  method. 
The  resulting  carrier  densities  are  used  to  recom¬ 
pute  the  band  edges  from  an  interpolated  expres¬ 
sion  for  the  bandgap  renormalization.  Then,  files 
appropriate  for  the  computed  density  are  read 
from  disk.  Since  the  exchange-correlation  terms 
are  added  as  k-independent  diagonal  elements, 
their  inclusion  does  not  significantly  alter  the 
effective  masses  and  therefore  the  densities  of 
state,  except  for  shifting  the  band-edges. 

We  now  briefly  describe  a  few  results  obtained 
from  MINILASE-II  for  an  operating  regime 
beyond  the  lasing  threshold.  Figure  3(a)  shows 
the  optical  gain  spectra  with  and  without  the 
Coulomb  enhancement  factor  for  the  same  applied 
current.  In  steady  state,  the  maximum  height  of 
the  gain  is  pinned  by  the  losses  in  the  laser  which  is 
the  same  in  each  case.  There  are  two  salient 
features  that  distinguish  the  gain  curve  without  the 
Coulomb  enhancement  from  the  other  curve. 
These  are  the  reduced  threshold  for  the  onset  of 
gain  and  the  larger  transparency  point.  Both 
features  stem  from  the  smaller  effective  matrix 
element  in  the  absence  of  Coulomb  enhancement 
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FIGURE  3  Gain  with  and  without  Coulomb  enhancement. 

that  requires  greater  pumping  of  the  laser  to 
achieve  a  fixed  gain.  Thus,  more  carriers  are 
required  which  results  in  a  reduction  in  the  band 
gap  and  implies  a  greater  separation  of  the 
electron  and  hole  quasi-fermilevels  and  thus  a 
larger  transparency  point. 

Figure  3(b)  displays  the  laser’s  response  to  a 
small  square  pulse  in  the  applied  voltage.  We  note 
that  the  frequency  peak  is  blue  shifted  when  a 
larger  effective  matrix  element  is  used.  This  may  be 
explained  as  follows.  The  photon  rate  equation  has 
the  form 

Su  =  (Gv-1/tt)S„  +  R 

where  v  is  a  mode  index,  Sv  is  the  photon 
population,  Gv  is  the  modal  gain,  r7  is  the  photon 
lifetime  and  R  is  the  spontaneous  emission  rate, 
which  is  much  smaller  than  the  other  terms  of  the 
right  hand  side  under  lasing  conditions.  Since  the 


losses  1/t  are  fixed,  the  gain  is  roughly  propor¬ 
tional  to  the  time  derivative  of  the  photon  density 
in  the  cavity  which  implies  a  more  swiftly  res¬ 
ponding  laser. 

In  conclusion,  we  described  how  an  accurate  k  p 
calculation  was  connected  to  a  sophisticated  laser 
simulator  and  demonstrated  how  the  larger 
effective  matrix  element  that  stems  from  the 
Coulomb  enhancement  affects  the  gain  and 
resonance  frequency  in  the  modulation  response 
of  a  laser. 
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Optical  and  Electronic  Properties  of  Semiconductor 
2D  Nanosystems:  Self-consistent 
Tight-binding  Calculations 
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A  tight-binding  models  which  account  for  band  mixing,  strain  and  external  applied 
potentials  in  a  self-consistent  fashion  has  been  developed.  This  allows  us  to  describe 
electronic  and  optical  properties  of  nanostructured  devices  beyond  the  usual  envelope 
function  approximation.  This  model  can  be  applied  to  direct  and  indirect  gap 
semiconductors  thus  allowing  for  instance  the  self-consistent  calculation  of  band  profile 
and  carrier  control  in  pseudomorphic  InGaAs/GaAs  HEMTs  and  SiGe/Si  MODFETs. 


Keywords:  Semiconductors,  nanostructures,  electronic  band  structure,  tight-binding,  optical 
properties 


1.  INTRODUCTION 

Electronic  and  optical  properties  of  semiconductor 
nanostructures  based  on  homo-  and  heterojunc¬ 
tions  have  been  investigated  theoretically  by 
means  of  a  variety  of  tools.  These  range  from  ab- 
initio  approaches  [1],  which  are  very  precise  but 
require  a  large  computational  effort  and  conse¬ 
quently,  are  limited  only  to  very  small  nanostruc¬ 
tures,  to  approximate  but  easy-to-handle  and  fast 
methods  such  as  for  example  those  based  on  the 
envelope  function  approximation  (EFA)  [2].  In  its 
simplest  form,  the  EFA  leads  to  the  evaluation  of 
the  energy  levels  of  nanosystems  by  simply  solving 
a  one-electron  Schrodinger  equation  where  each 
semiconductor  is  described  in  terms  of  effective 
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masses  and  band  edges.  Such  an  approach  and  its 
generalizations,  have  been  very  successful  [2-4], 
even  though  several  problems  still  cannot  be  easily 
handled  within  the  EFA  context.  The  empirical 
tight  binding  method  (TB)  [5-8]  has  been  shown 
to  be  a  valid  alternative  to  EFA,  since  it  improves 
the  physical  content  in  the  description  of  the 
nanostructure  with  respect  to  EFA  without 
requiring  a  much  higher  computational  effort.  In 
particular,  it  allows  us  to  treat  indirect-gap 
semiconductors,  heterostructures  made  by  indir¬ 
ect/direct  materials  and  to  describe  very  thin 
nanostructures  [5-7,  9].  Other  advantages  of  TB 
with  respect  to  EFA  consist  in  the  realistic 
description  of  the  band  structure  of  the  whole 
Brillouin  zone  and  in  the  possibility  to  use 
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different  Bloch  basis  functions  for  each  component 
material  of  an  heterosystem.  TB  has  been  mainly 
used  in  the  calculation  of  the  electronic  properties 
of  nanostructures  without  taking  into  account  self- 
consistent  charge  redistribution,  which  is  an 
important  requirement  when  we  deal  with  real 
systems.  However,  very  recently,  we  have  shown 
[10]  that  a  self  consistent  tight-binding  procedure 
can  be  defined.  In  this  paper  we  will  apply  the  self- 
consistent  tight-binding  approach  to  realistic 
nanostructures  such  as  pseudomorphic  HEMT 
and  we  will  show  how  the  method  can  be  easily 
extended  to  handle  the  case  of  indirect  band  gap 
material  such  as  in  SiGe  MODFETs. 

2.  SELF-CONSISTENT  TIGHT  BINDING 

In  this  section  we  discuss  the  self-consistent  tight- 
binding  model  for  a  system  where  symmetry  is 
broken  in  one  direction,  for  example  the  growth 
axis  (z).  The  wave  function  \E ,  Ay)  can  be  written 
as  linear  combination  of  planar  Bloch  sums,  \a,  m) 


[11] 

l-£**n>  =  'ECa’m(kbE)\a>m) 

a) 

with 

a,  m 

W,m)  =  -mZe“r*\atRa)  , 

VN 

(2) 

ii 


where  |a/?a)  is  a  localized  orbital,  Ay  is  the  in-plane 
wave  vector  and  N  is  the  number  of  unit  cells  in 
the  atomic  plane.  The  subindex  a  refers  both  to 
the  basis  atom  index  and  to  the  atomic  orbital 
index.  The  lattice  vector,  Ra  =  R+  va,  (where  va  is 
the  basis  atom  displacement),  can  be  written  as 
Ra  =  md+R™ ||  where  m  is  an  integer,  d  a  vector 
parallel  to  the  growth  direction  with  modulus 
equal  to  the  distance  between  two  atomic  planes 
and  J?™||  is  a  vector  on  the  ra-th  atomic  plane.  For  a 
given  A ||,  the  eigenstates  E  are  calculated  by 
solving  the  secular  equation  (H+VH)\E,k\\) 
=  E\E,  Ay)  where  H  is  the  system  tight-binding 
hamiltonian  and  VH  is  the  Hartree  potential.  In 


order  to  calculate  electronic  and  optical  properties 
for  real  heterosystems,  the  presence  and  possible 
rearrangement  of  free  charges  has  to  be  taken  into 
account.  The  influence  of  the  electronic  charge 
rearrangement  can  be  included  at  a  Hartree  level 
by  solving  the  Poisson  equation  for  the  Hartree 
potential,  d2VHjdz 2  =  —p(z)/  e,  where  e  is  the 
static  dielectric  constant.  The  charge  density  p(z)  is 
defined  by  the  square  of  the  wave  function 
averaged  over  a  layer  z  and  weighted  by  the  Fermi 
function  f(E): 

^(Z)  =  /0  \2  [  ^11  l^Mii  1  f{En),  (3) 
(2?r)  J  n 

where  e  is  the  electron  charge  and  n  labels  the 
energy  levels  for  a  given  Ay.  A  full  Ay  integration  is 
performed  in  the  2D  Brillouin  zone  by  using  the 
special  A-points  technique  in  the  irreducible  wedge 
[12].  The  convergence  of  this  integration  has  been 
obtained  by  using  5  special  points  with  |A|||  <  0.06 
2irla  for  direct  band  gap  material.  To  achieve  self- 
consistency  in  indirect  band  gap  material  we  use  8 
points  with  |A|j|  <  0.2  27r/a. 

The  Poisson  and  Schrddinger  equations  in  the 
tight-binding  representation  are  iteratively  solved 
until  convergence  is  reached.  An  open-chain 
(infinite  well)  boundary  condition  is  used  for 
Schrodinger’s  equation.  In  order  to  avoid  influ¬ 
ences  on  calculated  electronic  levels,  boundaries 
are  chosen  far  away  from  the  nanostructure  region 
where  the  density  of  conduction  electrons  or 
valence  holes  is  high.  Although  a  better  choice 
for  the  boundary  condition  is  provided  by  complex 
band  structure  states  as  explained  in  Ref.  [11],  for 
all  the  situations  discussed  here  the  open  chain 
condition  represents  a  valid  and  simple  choice.  In 
order  to  speed  up  the  self-consistent  algorithm  we 
have  introduced  a  hybrid  method  to  diagonalize 
the  tight-binding  hamiltonian  which  uses  a  stan¬ 
dard  (LAPACK  [13])  routine  to  calculate  eigen¬ 
values  and  an  inverse  iteration  scheme  to  calculate 
eigenvectors.  The  initial  conditions  for  Schrodin- 
ger  and  Poisson  equations  are  obtained  self 
consistently  in  the  effective  mass  approximation. 
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When  optical  properties  are  of  interest,  one  can 
make  use  of  the  Kubo  formula  to  define  the 
susceptibility  tensor  which  is  related  to  the  current- 
current  response  function  of  the  electromagnetic 
perturbation  [10, 14]. 

3.  RESULTS 

In  order  to  show  some  results  obtained  with  TB 
coupled  with  carrier  redistribution,  we  consider 
several  HEMT-like  structures. 

If  we  consider  a  typical  device  with  a  430  A  n- 
doped  (n  =  1018  cm-3)  cap  layer,  20  A  undoped 
spacer  in  Alo.3Gao.7As,  100  A  GaAs  2D  channel 
and  Alo.3Gao.7As  buffer,  TB  shows  small  improve¬ 
ments  in  band  profile,  charge  density  and  energy 
levels  with  respect  to  EFA.  However,  a  more 
critical  situation  is  obtained  when  higher  A1 
concentrations  are  considered  in  the  cap  layer, 
which  bring  the  barrier  region  to  be  almost 
indirect  gap  (e.g.,  Er  =  Ex)-  Thus,  we  consider  in 
Figure  1  a  HEMT  similar  to  the  previous  one,  but 
with  Alo.45Gao.55As  as  cap  and  spacer  layer 
material.  It  becomes  evident  (see  Fig.  2)  that 
EFA  completely  fails  in  describing  the  charge 
redistribution  between  the  2D  channel  and  the 
parasitic  channel  which  forms  due  to  the  large  X- 
valley  contribution  to  the  charge  density. 


FIGURE  1  Self  consistent  band  profile  of  a  Alo.45Gao.55As/ 
GaAs/Alo.3Gao.7As  HEMT  structure 


FIGURE  2  Self  consistent  charge  density  of  the  HEMT  of 
Figure  1 


The  second  example  refers  to  a  pseudomorphic 
(PM)  structure  consisting  of  a  360  A  n-doped 
(n  =  1018  cm-3)  Al0.2Ga0.8As,  20  A  undoped 
Alo.2Gao.3As,  a  265  A  strained  well  of  In0.i5- 
Ga0.85As  and  a  GaAs  substrate.  Such  a  modula¬ 
tion-doped  structure  is  typical  for  PM  HEMT’s,  a 
device  of  great  importance  in  the  microwave  field 
[15].  In  this  calculation  the  background  doping 
charge  is  simply  added  to  the  free  charge  as  given 
by  Eq.  (3).  The  self-consistent  potential  profile  of 
the  structure  for  two  applied  gate  voltages  of  —0.6 
and  0.0  V  respectively  is  shown  in  Figures  3a  and 
3b,  respectively.  Strain  effects  in  the  InGaAs 
region,  which  are  taken  into  account  by  scaling 
the  hamiltonian  matrix  elements  [16],  split  the  HH 
and  LH  valence  bands.  Here  the  labels  HH  and 
LH  refer  to  the  character  of  the  valence  band  in 
the  growth  direction.  In  the  parallel  direction  the 
uppermost  valence  band  has  a  light  hole  character 
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FIGURE  3  Self  consistent  band  edge  profile  of  the  Al0.2- 
Ga0.8As/In0.i5Ga0.85As/GaAs  pseudo-morphic  structure  at  ap¬ 
plied  gate  voltages  of  —0.6  V  (a)  and  0.0  V  (b).  The  depth  is 
measured  in  monolayers  [ML]. 
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while  the  other  a  heavy  hole  one.  Strain  also 
induces  large  changes  on  the  effective  masses  of  the 
valence  bands.  For  the  unstrained  situation  we 
have  mHH  =  0.4475  m0  and  mLH  =  0.06481  m0 , 
while  for  bulk  Ino.15Gao.g5As  strained  on  GaAs  the 
effective  masses  are  m##  [001]  =  0.4328  m0 ,  mLH 
[001]  =  0.09596  /wo,  m^lOO]  =  0.08599  m0  and 
mHH  [100]  =  0.1391  m0.  In  the  confined  system,  the 
hole  energy  levels  present  a  different  in-plane  mass 
which  depends  on  the  energy.  For  an  applied 
potential  of  -0.6  V  the  first  three  hole  levels 
effective  masses  are  mA1[100]  =  0.10092  m0 ,  mh2 
[100]  =  0.14799  mo,  m/z3  [100]  =  0.2421  m0,  respec¬ 
tively.  We  observe  that  carrier  confinement  in¬ 
duces  an  enhancement  of  the  effective  mass 
compared  to  strained  bulk  material,  a  consequence 
of  band  non-parabolicity. 

The  calculated  photoluminescence  spectra  for 
several  applied  potentials  are  shown  in  Figure  4. 
Since  the  HH  quantized  levels  are  now  above  the 
LH  states,  the  luminescence  transitions  occur 
between  conduction  levels  and  the  heavy  hole 
levels.  For  an  applied  voltage  of  -0.6  V,  which 
corresponds  to  a  nearly  fiat  InGaAs  band,  the 
main  luminescence  peaks  are  related  to  the 
Ei  — >  HHi  and  E2  — >  HH2  transitions.  This  is  in 
agreement  with  the  selection  rules  Anm  =  0, 
where  n  labels  the  conduction  states  and  m  the 
valence  states  [2].  The  emitted  light  is  mainly 


polarized  in  the  in-plane  direction  since  the  levels 
have  essentially  a  heavy-hole  character.  When  the 
applied  potential  is  reduced,  two  other  peaks  form 
which  correspond  to  the  Ex  — >  HH2  and  E2^> 
HHX  transitions.  Indeed,  by  decreasing  the  applied 
potential,  the  increasing  channel  electron  density 
induces  a  sizable  band  bending  which  is  respon¬ 
sible  for  the  loss  of  symmetry  (see  Fig.  3b)  and  the 
transitions  EX-*HH2  and  E2^>  HHX  are  no 
longer  forbidden.  We  also  notice  the  presence  of 
a  red  shift  (quantum  Stark  shift)  of  the  Ex  — ►  HHX 
transition  due  to  the  presence  of  an  electric  field  as 
the  applied  potential  reduces  from  -0.6  to  0.0  V. 

As  final  example  we  show  in  Figure  5  the 
calculated  band  profile  (a)  and  charge  density  (b) 
of  a  /7-type  SiGe/Si  MODFET.  Strain  is  respon¬ 
sible  for  the  splitting  of  heavy  and  light  hole  bands 
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FIGURE  4  Photoluminescence  spectra  of  the  pseudomorphic  FIGURE  5  Self  consistent  band  profile  (a)  and  charge  density 

structure  for  several  applied  voltages.  The  dashed  lines  follow  (b)  of  a  Si0.98Geo.o2/Sio.7Ge0.3/Si  p-channel  MODFET  struc- 

the  different  luminescence  peaks.  ture. 
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as  in  the  previous  case.  The  charge  density  shows  a 
strong  difference  between  TB  and  EFA.  This  is  is 
mainly  due  to  the  high  non-parabolicity  and 
anisotropy  of  the  valence  band. 

In  conclusion,  we  have  shown  that  a  self- 
consistent  tight-binding  approach  can  be  used  to 
evaluate  the  electronic  structure  and  optical 
properties  of  semiconductor  nanostructures.  This 
represents  a  further  step,  with  respect  to  the 
envelope  function  model,  towards  an  at  initio 
calculation  of  such  properties. 
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Transient  Phenomena  in  High  Speed  Bipolar  Devices 
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A  new  numerical  method  is  applied  to  the  analysis  of  the  charge  partitioning  in  the 
quasi-neutral  base  of  a  BJT.  The  results  show  that  the  conventional,  1 : 2  collector/ 
emitter  partitioning  is  not  valid  in  general.  High  level  injection  increases  the  collector 
fraction,  whilst  fast  switching  decreases  it. 


Keywords:  BJT  capacitance,  transient  semiconductor  device  simulation 


1.  INTRODUCTION 

As  shown  in  [1],  the  charge  partitioning  may 
significantly  affect  device  delay  times.  Non-Quasi- 
Static  (NQS)  BJT  modeling  [2]  also  demonstrates 
the  practical  importance  of  such  effects  in  modern 
devices.  The  ever  increasing  speed  of  microelec¬ 
tronic  devices  [3,  4,  7]  requires  the  development  of 
more  accurate  models  to  describe  transient  beha¬ 
vior. 

Papers  [1,  2]  are  based  on  “first  order”  NQS 
corrections.  Although  the  latter  approaches  are 
significantly  different,  they  are  similar  in  their  use 
of  the  quasi-static  approximation  for  the  charge 
partitioning  factor  in  the  quasi-neutral  base 
(QNB).  The  accuracy  of  the  NQS  method  [2]  is 
higher  since  it  takes  into  account  a  delay  in  the 
QNB  charge,  not  only  in  the  collector  current. 


In  the  present  paper  we  use  an  exact,  2D 
numerical  technique,  for  the  evaluation  of  the 
charge  partitioning  in  semiconductor  devices  [5].  It 
is  applied  here  to  study  the  partitioning  of  the 
QNB  charge  (related  to  the  diffusion  capacitances 
of  a  BJT).  The  numerical  method  allows  the 
separation  of  the  charges  into  quasi-neutral  and 
space  charge  components,  and  the  extraction  of 
the  transient  components  of  terminal  currents, 
responsible  for  changes  in  the  device  internal 
charge. 

Previous  results,  obtained  for  a  ID pn- diode  [5], 
demonstrated  a  dramatic  change  of  the  charge 
partitioning  of  the  charge  injected  into  the  quasi¬ 
neutral  region  depending  on  the  ramp  speed.  It 
was  shown  that  the  injection  level  strongly 
influenced  the  charge  partitioning  which  could 
exceed  0.6  of  the  net  base  charge  in  very  high  level 
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injection.  In  contrast,  in  low  level  injection,  a 
partitioning  of  0.33  is  obtained  which  agrees  well 
with  the  conventional  diffusion  theory.  A  similar 
trend  was  noticed  in  [2]  where  small-signal  ID 
analysis  of  a  BJT  was  performed  (am  parameter  in 
Fig.  4,  [2]).  Charge-based  models  e.g.,  those 
stemming  from  [8,  9]  are  preferred  to  describe 
transient  device  behavior  and  there  has  been 
renewed  interest  in  charge  partitioning  [1,  10-21]. 

The  majority  of  authors  [1,10-21]  have 
addressed  the  partitioning  problem  using  simpli¬ 
fied  analytic  methods.  Analytic  models,  such  as 
those  proposed  for  MOSFETs  [10,  11]  and  BJTs 
[12,  13,  15,  16,  18,  19]  provide  charge  conserving, 
partitioned  charge  descriptions.  Analytic  models, 
based  on  first  or  higher  order,  non-quasi-static, 
solutions  of  the  continuity  and  transport  equations 
have  also  been  proposed  [12,  17,  20,  21].  To  obtain 
closed  form  expressions,  useful  for  device  models, 
such  analysis  requires  a  significant  number  of 
simplifying  assumptions. 

A  numerical  techniques  needed  to  extract  the 
various  charge/current  components,  necessary  to 
accurately  characterize  the  charge  partitioning, 
have  been  developed  recently  [5].  These  techniques 
were  implemented  in  TRASIM  [22]  which  provides 
an  accurate,  stable,  and  rapidly  convergent,  time- 
dependent,  numerical  solution  of  the  two-dimen¬ 
sional  Poisson  and  continuity  equations,  for 
specified,  time  dependent  boundary  conditions. 
The  numerical  results  show  that  there  are  sig¬ 
nificant  differences  between  the  exact  charge 
partitioning  and  the  predictions  of  the  analytic 
models. 

Both  the  present  and  previous  analytic  models  are 
based  on  the  conventional,  drift-diffusion  model  for 
carrier  transport.  The  numerical  analysis  confirms 
the  validity  of  this  approach  in  BJTs,  since  the 
electric  fields  (or  Fermi-level  gradients)  remain 
modest,  at  even  the  highest  switching  speeds. 

Non-the-less,  the  Ohmic  electric  fields,  asso¬ 
ciated  with  the  removal  of  the  neutralizing, 
majority  carriers,  are  large  enough  to  cause  major 
changes  in  the  minority  carrier  partitioning.  Such 
effects  are  completely  omitted  in  analytic  models. 


2.  DEPLETION  AND  DIFFUSION  CHARGES 

Conventional  device  models  associate  the  motion 
of  the  space-charge  region  boundaries  with  charge 
storage  in  a  “depletion  capacitance”,  and  the  stor¬ 
age  of  minority  carriers  in  quasi-neutral  regions 
with  a  “diffusion”  capacitance.  A  conventional 
numerical  model  provides  no  such  distinction.  The 
total  charge  for  each  carrier  type  must  be  split  into 
a)  a  space-charge  part,  corresponding  to  a  deple¬ 
tion  capacitance  and  b)  a  quasi-neutral  part, 
corresponding  to  a  diffusion  capacitance. 

The  extraction  of  the  space  charge  and  quasi¬ 
neutral  charge  have  been  described  in  detail  earlier 
[5].  Following  the  methods  described  in  [5]  the 
depletion  and  diffusion  charges  are  found  from 

QnVp  =  jQ  Pn,P  dxdy>  QZ  =  jQ  ftp  ***>  0) 

where  ft  is  the  computational  domain.  The  charges 
defined  in  Eq.  (1)  represent  only  the  changes  in  the 
device  charges/carrier  concentrations  that  occur  as 
a  result  of  the  applied  switching  ramp. 

3.  THE  PASS-THROUGH  CURRENT 

It  is  necessary  to  compute  the  “pass-through” 
current  [5],  defined  as  the  current  that  would  flow 
across  the  device  if  the  instantaneous  bias  and  the 
instantaneous,  transient  internal  potential  distri¬ 
bution  were  held  constant  (c.f.  “convective  cur¬ 
rent”  in  [23]).  The  time  dependent  pass-through 
current  must  be  subtracted  from  the  terminal 
currents  since  it  does  not  contribute  to  changes  in 
the  stored  charges. 

The  “pass-through”  current  Jp„  and  Jpt  are 
computed  in  a  separate  series  of  iterations,  solving 
only  the  steady  state,  non-linear  continuity  equa¬ 
tions 

VJP'  =  qR ;  V/f  =  -qR.  (2) 

During  a  fast  transient,  or  as  a  result  of  high  level 
injection,  the  electrostatic  and  majority  carrier 
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quasi-Fermi  potential  distributions  -0  (x,  y)  and 
(fpix,  y)  change  during  the  transition.  It  is  insuffi¬ 
cient  to  merely  interrupt  the  changes  in  terminal 
voltages  and  to  then  compute  the  conventional 
steady-state  current. 

For  even  a  very  slow  transient  errors  arise: 
Although  the  change  in  the  potential  distribution 
will  be  smaller,  the  integration  time  is  increasing 
and  still  leads  to  an  erroneous  calculation  of  the 
partitioning.  Unfortunately  the  effects  of  internal 
potential  variation,  imply  also  that  the  conven¬ 
tional  “steady-state”  part  of  a  circuit  model  is  no 
longer  adequate  to  predict  the  corresponding 
terminal  currents. 

Integration  of  the  solutions  of  Eq.  (2)  over  the 
device  contacts,  gives  Ylk=\  The  total 

charge,  arriving  at  the  A;-th  terminal  is  defined  as 

Qk(t)  =  -  jp:k(n)dt'. 

4.  SIMULATION  RESULTS 

The  modifications  described  above  were  imple¬ 
mented  in  TRASIM  [22],  and  the  modified  version 
of  TRASIM  used  for  transient  2D  numerical 
simulation  of  a  CML  BJT  [4].  Depending  on  the 
functional  purpose  of  the  transistor  and  the  desire 
to  increase  speed  of  the  ECL  circuit,  the  transistor 
may  operate  in  a  high  current  mode,  causing  high 
level  injection.  Rather  few  papers  have  been 
published  on  the  high  injection  behavior  of  the 
BJT  (e.g.  [6]).  We  believe  that  only  thorough 
numerical  investigation  will  shed  light  on  the 
highly  nonlinear,  transient  problem. 

4.1.  Device  Structure 

The  transistor  structure  is  shown  in  Figure  1  (we 
follow  the  data  and  SEM  photograph  from  [4]). 
Only  one  half  of  the  transistor  is  simulated  to 
reduce  the  computational  burden.  The  depth  of 
the  buried  layer  was  0.575  pm,  having  Gaussian 
impurity  distribution  with  exponent  of  0.1  pm  and 
peak  concentration  of  1019cm-3.  The  selectively 
epitaxially  grown  active  collector  has  impurity 


concentration  of  1.8  •  1017cm-3.  The  epitaxial  base 
layer  has  a  concentration  of  2.5  1018cm-3  and 
width  of  70  nm.  The  polysilicon  emitter  was 
modeled  by  a  surface  recombination  104cm/s, 
effective  p-n  junction  depth  of  25  nm  and  a  half 
emitter  with  and  area  of  0.175  pm  and  0.35  x  2.6 
pm2  respectively.  SRH,  Auger  and  high  doping 
effects  were  taken  into  account.  This  device  shows 
/3  about  95  and  Ic  of  1mA  at  V\B  =  -0.92 
and  FCb  =  2V  which  matches  the  experimental 
data  [4]. 

4.2.  Charge  Partitioning  for  Different 
Injection  Levels 

The  transients  were  simulated  at  VCb  =  2V  with 
Feb  switched  from  F|B  to  0  V  with  a  linear  ramp 
of  duration  rr  seconds.  The  collector  charge 
fraction  Pc  was  calculated  from  Pc  ~  Qc(t)l 
(Qc(0  +  Qe(0)>  where  QE(t)  and  Qc{t)  are 
charges  recaptured  at  the  emitter  and  collector 
terminals  respectively  after  time  t.  Figure  2  shows 
the  time  evolution  of  the  dynamical  charge 
partitioning  of  the  QNB  charge.  As  we  see, 
increasing  the  injection  level  leads  to  a  bigger 
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Time  (nanoseconds) 

FIGURE  2  Charge  partitioning  for  different  injection  levels 
rr  —  0.01  ns. 
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FIGURE  3  Electron  density  profile  across  the  current  flow  in 
the  QNB. 


collector  charge  fraction.  This  effect  is  due  to  the 
built-in  electric  field  in  the  QNB  associated  with 
the  high  injection  level  and  the  extraction  of  the 
compensating  majority  carriers. 

Figure  3  shows  2D  effects  present  when  dischar¬ 
ging  the  base  of  the  transistor,  in  low  injection. 
The  vertical  dashed  line  in  the  Figure  3  shows  the 
“current  channel  boundary”.  Electrons  injected 
outside  this  channel  are  diverted  to  the  collector 
rather  than  to  the  emitter.  After  switching-off  the 
transistor  they  are  collected  by  the  collector 
effectively  increasing  the  collector  fraction. 

4.3.  Charge  Partitioning  for  Different 
Ramp  Speeds 

Figure  4  shows  the  dynamical  charge  partitioning 
of  the  QNB  charge  for  different  ramp  speeds  at 
Veb  =  -0.95  V.  Increasing  ramp  speed  leads  to 
smaller  collector  charge  fraction  which  is  consis¬ 
tent  with  the  earlier  results  for  a  p-n  diode  [5],  This 
effect  is  due  to  the  transient  electric  field  in  the 
QNB  caused  by  the  hole  current. 

Figure  5  shows  the  vertical  component  of  the 
electron  current  density  during  transient  for  a  slow 
and  fast  ramps  under  high  level  injection.  The 
lateral  electric  field,  caused  by  the  fast  base 
discharge,  pinches  the  electron  current  flowing 
from  the  emitter  to  the  collector,  leading  to  a  non- 


FIGURE  4  Charge  partitioning  for  different  ramps,  V  cB  = 
-0.95  V. 
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FIGURE  5  Vertical  current  density  component  for  a  slow  and 
fast  transient. 
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monotonic  dependence  of  the  electron  current 
density  underneath  the  center  of  the  emitter.  For 
the  slow  transients  the  current  density  profiles 
retain  their  shape  during  the  transient. 


CONCLUSIONS 

A  number  of  different  BJT  operating  conditions 
were  simulated.  Some  results  depend  on  the  device 
structure,  nevertheless  a  few  general  conclusions 
seem  to  be  possible  at  this  point: 


i)  At  low  level  injection  charge  partitioning 
deviates  from  the  theoretical  1:2  collector:base 
partition  due  to  the  2D  effects. 

ii)  In  a  high  current  mode,  when  high  injection 
occurs,  the  resulting  electric  built-in  field  the 
QNB  leads  to  a  collector  charge  partition  that 
grows  with  VBE . 

iii)  The  ramp  speed  affects  charge  partitioning  in 
both  high  and  low  injection  level  by  effectively 
reducing  the  collector  charge  fraction  at  faster 
ramps. 

It  is  not  possible  draw  comparisons  with  [6] 
which  models  a  device  of  base  width  200  microns 
and  zero  recombination. 
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We  report  the  theoretical  investigation  of  the  phenomenon  of  the  formation  of  patterns 
transverse  to  the  tunneling  current  in  resonant  tunneling  double-barrier  heterostructures 
in  the  case  of  wide  range  of  bistable  voltages.  In  contrast  to  the  case  of  the  patterns  in 
the  structures  with  small  region  of  bistability,  for  pronounced  bistability  electron  lateral 
transport  is  strongly  nonlocal.  We  performed  numerical  simulations  of  the  stationary 
and  mobile  patterns  using  special  variational  procedure.  Our  results  revealed  that 
though  the  possible  types  of  patterns  remains  the  same  as  for  the  structures  with  small 
bistability  region,  their  characteristics  are  modified  considerably. 


Keywords:  Resonant  tunneling,  bistability,  patterns,  ballistic  transport 


1.  INTRODUCTION 

It  is  well  established  that  the  resonant  tunneling  in 
the  double  barrier  heterostructures  (DBH)  is 
supplemented  by  the  dynamic  charge  accumula¬ 
tion  in  the  quantum  well.  This  charge  accumula¬ 
tion  is  particularly  substantial  in  the  case  of  the 
structure  with  asymmetrical  barriers.  The  impor¬ 
tant  effect,  induced  by  the  built-up  charge  is 
intrinsic  bistability  of  the  system  under  considera¬ 
tion.  For  some  range  of  biases  two  stable  states 
exist  at  the  same  bias.  One  state  is  characterized  by 
a  large  built-up  charge,  resonant  tunneling  condi¬ 
tions  and  a  large  current,  the  other  one  corres¬ 


ponds  to  resonance  breaking,  lowering  of  the 
quasi-bound  state  below  the  bottom  of  the  emitter 
band  and  a  low  charge  built-up  and  current.  The 
effect  of  bistability  transforms  the  shape  of  the 
current-voltage  characteristic  of  the  resonant 
tunneling  diode  from  N-type  to  Z-type.  The  effect 
of  bistability  was  experimentally  observed  in 
[1-3].  Theoretical  investigations  of  the  bistability 
were  performed  in  [4-6].  In  these  papers,  the 
tunneling  was  considered  as  one-dimensional  and 
the  transport  through  the  DBH  was  supposed  to 
be  dependent  on  only  one  coordinate,  perpendi¬ 
cular  to  the  barriers.  Actually,  most  of  the  DBH 
are  layered  ones  and  a  tunneling  electron  moves 
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not  only  across  the  layers  (vertical  transport),  but 
also  along  these  layers  (horizontal,  or  lateral, 
transport).  In  previous  works  [7,  8]  we  have  shown 
that  under  the  bistable  conditions  not  only 
laterally  uniform,  but  also  nonuniform  configura¬ 
tions  of  built-up  charge  and  tunneling  current 
(patterns)  can  exist.  Results  of  [7,  8]  were 
applicable  mainly  to  DBHs  with  a  small  range  of 
bistable  voltages,  where  local  approach  for  elec¬ 
tron  lateral  transport  was  applied.  In  this  work  we 
present  results  of  numerical  simulations  of  sta¬ 
tionary  and  mobile  patterns  for  the  DBHs  with 
wide  voltage  range  of  bistability,  where  electron 
lateral  transport  is  ballistic  and  strongly  nonlocal. 

2.  MODEL  AND  BASIC  EQUATIONS 

Since  the  problem  of  the  transverse  patterns 
requires  at  least  a  two  dimensional  spatial  analysis, 
we  use  a  simple  model,  showing  the  main  features 
of  the  bistability  and  the  patterns.  We  deal  with 
the  model  of  a  resonant  tunneling  heterostructure, 
schematically  shown  in  Figure  1.  The  structure  is 
treated  as  a  system  of  three  parts,  weakly  coupled 
by  tunneling:  emitter  ( E ),  quantum  well  (QW)  and 
collector  (C).  The  electrodes  E  and  C  are  usually 
heavy  doped  semiconductors  and  are  supposed  to 


E 
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QW 
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FIGURE  1  The  scheme  of  the  resonant  tunneling  structure 
and  its  energy  band  diagram  under  bias. 


be  ideal  conductors.  The  energy  height  of  the 
barriers  Bu  B2  is  V  and  their  thicknesses  are 
dsx,  ds2 ,  respectively.  Charge  accumulation  in  the 
well  causes  a  change  of  the  potential  profile  in  the 
whole  structure.  It  alters  the  position  of  the  quasi¬ 
bound  state  with  respect  to  the  bottom  of  the  energy 
band  of  the  emitter  and,  in  general,  with  respect  to 
the  bottom  of  the  quantum  well.  We  disregard  the 
latter  effect  and  consider,  that  the  built-up  charge 
shifts  equally  the  well  bottom  and  the  quasi-bound 
level.  Such  a  case  corresponds  to  the  very  thin 
quantum  well,  where  the  built-up  charge  can  be 
accounted  for  as  an  infinitely  thin  sheet. 

The  horizontal  electron  transfer  is  the  main 
process,  determining  the  transverse  patterns.  This 
transfer  can  be  thought  as  follows.  The  electron  is 
injected  from  the  emitter  to  the  well  in  general  with 
a  finite  horizontal  component  of  the  momentum  p 
or  velocity  v=p/m *  (m*  is  effective  mass).  The 
velocity  depends  on  the  position  of  the  quasi¬ 
bound  state  energy  with  respect  to  the  Fermi 
energy  EF  in  the  emitter:  when  the  energy  quasi¬ 
bound  level,  e0,  moves  from  EF  through  the 
bottom  of  the  emitter  band  Eq>,  the  velocity 
changes  from  zero  to  the  Fermi  velocity 
vF  =  y/2EF/m*.  For  estimates,  one  can  account 
that  v  and  vF  have  the  same  order  of  magnitude. 
We  can  introduce  the  characteristic  time  for 
horizontal  transfer:  time  of  tunneling  escape  from 
the  well  Tes.  As  we  show  below,  the  existence  of  the 
developed  intrinsic  bistability  is  strongly  related 
with  the  character  of  electron  lateral  transport. 
Namely,  the  time  of  electron  tunneling  escape 
from  the  QW  should  be  smaller  than  the  time  of 
scattering  on  phonons,  impurities,  etc.  It  means, 
that  between  the  tunneling  events  electron  moves 
in  the  Q  W  ballistically .  As  a  result,  the  horizontal 
characteristic  distance  is  Lch  =  vres.  One  can  expect 
that  the  scale  of  the  patterns  in  question  is  of  the 
order  of  Lch. 

For  narrow  resonant  level  from  the  uncertainty 
relation  we  can  write  sores  %.  Combining  this 
inequality  with  the  fact,  that  EF,  e0  and  the  kinetic 
energy  of  the  horizontal  motion  ra*v2/ 2  are  of  the 
same  order  of  magnitude,  we  find  for  the  in-plane 
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wave  vector  k : 

n  jfi* 

k Lch  =  -Lch  =  — res  ~  e0  res  >  1.  (1) 

The  latter  estimate  shows,  that  horizontal 
transfer  can  be  considered  as  classical ,  while  the 
vertical  transport  should  be  treated  as  quantum.  As 
a  result,  we  can  introduce  the  distribution  function 
of  the  resonant  electrons  in  the  QW.  Assuming 
weak  tunneling  coupling  between  E,  QW  and  C, 
we  derive  the  kinetic  equation  for  the  distribution 
function  / : 


uniform  tunneling  in  the  x,  y  plane.  In  such  a  case 
the  r,  t  dependences  are  absent  and  from  kinetic 
Eq.  (2)  one  can  find  the  areal  electron  concentra¬ 
tion 

n(4>)  =  n0(<l>)  =  Tes(<t> )  ^2  t),P)-  (4) 

P 

Since  the  left-hand  side  of  (4)  is  a  function  of  </>, 
we  get  two  algebraic  Eqs.  (3)  and  (4)  for  two 
variables  n ,  </>.  It  is  convenient  to  rewrite  this 
system  as 


df  p  df  d(j)  df  ,  / 


+  /{/} 


(2) 


where  <p(r )  is  the  electrostatic  potential  energy  in 
the  well,  G((j)(r ,  t),  p )  is  the  local  rate  of  tunneling 
from  the  emitter  to  the  well,  res  is  the  tunneling 
escape  time,  /{/}  is  the  collision  integral  for  the 
electrons  inside  the  well,  G  and  res  are  functions  of 
cj)  at  fixed  r.  They  are  expressed  through  the 
tunneling  probabilities  and  the  Fermi  distribution 
of  electrons  in  the  emitter. 

From  the  uncertainty  condition  (1)  we  can 
deduce,  that  the  characteristical  scale  Lch  greatly 
exceeds  the  well  width.  We  assume  Lc h  is  much 
larger  than  the  thickness  of  the  DBH:  Lch  >  d.  In 
this  case  one  can  use  the  quasi-local  relation 
between  (j>  and  electron  density  n  =  %/(?,  p ,  t): 

<3) 

which  can  be  derived  from  the  Poisson  equation. 
Here  k  is  dielectric  permittivity,  $  is  the  external 
bias  in  the  energetical  units.  Eqs.  (2),  (3)  compose 
the  system  of  coupled  nonlinear  equations  which 
describe  the  system  under  consideration. 


3.  BISTABILITY  IN  THE  UNIFORM 
STRUCTURE 

Let  us  show,  that  the  model  formulated  above 
allows  the  bistable  vertical  transport  regimes  with 


(5) 

For  the  particular  heterostructure  the  latter 
equation  has  one  controlling  parameter,  external 
bias  $. 

We  have  calculated  the  functions  res(0)  and  g{4>) 
for  the  heterostructure  with  parameters:  V=  1  eV, 
m*  =0.067  m0,  d^S.l  nm,  dB]  =  2nm,  £0  =  0.1  eV, 
k=  11.5.  In  Figure  2  the  left  and  right-hand  sides 
are  shown  for  £>=56  meV  and  zero  temperature. 
The  dependence  of  L  on  $  is  weak  and  only  one 
curve  L  (</>)  is  shown.  Qualitatively  the  dependence 
L((j>)  corresponds  to  the  switching  on  of  the 
resonant  tunneling  when  the  resonant  level  crosses 
the  bottom  of  emitter  conduction  band  and  further 
gradual  decrease  of  the  number  of  resonant 
electrons  in  the  emitter  while  the  resonant  level  is 
shifted  toward  the  Fermi  level  of  the  emitter.  R{<j>) 
is  just  a  straight  line  whose  vertical  shift  is 
determined  by  the  bias.  From  Figure  2  one  can 
see  that  our  system  possess  the  property  of 
bistability  in  the  particular  range  of  biases 
Corresponding  current- voltage  char¬ 
acteristic  is  shown  in  the  insert  of  Figure  2. 

The  approach,  used  in  Eq.  (2)  allows  to  include 
into  the  tunneling  generation  rate  G  the  effect  of 
scattering  of  the  resonant  electrons  in  the  QW  (see 
[9]).  The  analysis  of  this  effect  has  shown  that  the 
wide  range  of  the  bistability  can  take  place  if 
<7~ ew  ^  ^ scT cw/  (t cw'T- sc) •  Here  Tew,  tcw  are  the  char- 
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FIGURE  2  Illustration  of  the  self-consistent  solutions  of 
the  bistability  problem  under  uniform  tunneling  for  the 
structure  described  in  the  text.  Curve  1  is  almost  independent 
on  the  external  bias  and  other  curves  are  R(<j))  for  two 
biases:  0.29  eV  (curve  2)  and  0.31  eV  (curve  3).  The  insert  shows 
the  current-voltage  characteristic.  L  and  R  are  in  units 
10ncm“2. 


acteristical  times  of  tunneling  from  the  emitter  to 
the  QW  and  from  the  QW  to  the  collector, 
calculated  for  the  electrons  with  the  energy  of 
vertical  motion  equal  to  the  Fermi  energy  in  the 
emitter,  and  rsc  is  the  characteristical  time  of 
scattering.  It  means,  that  the  bistable  range  of 
voltages  is  substantial  for  the  structure  with  high 
asymmetry  in  the  barriers  transmission  coefficients 
(the  collector  barrier  should  be  less  transparent) 
and  with  ballistic  or  quasi-ballistic  character  of 
lateral  electron  transport.  Furthermore,  this  fact 
explains  dependence  of  the  bistability  on  the 
temperature,  which  is  usually  observed  in  experi¬ 
ments:  at  higher  temperatures  rsc  decreases  and 
this  washes  out  the  asymmetry  of  tunneling  in  the 
system  and  the  effect  of  bistability. 


4.  PATTERNS  UNDER  BALLISTIC  REGIME 
OF  HORIZONTAL  TRANSFER 

We  consider  the  theory  of  the  one-dimensional 
patterns,  in  which  all  physical  values  (built-up 
charge,  tunneling  currents,  etc.)  depend  on  only 
one  coordinate,  namely  y .  Then,  we  assume 
completely  ballistic  lateral  electron  transport.  In 


this  case  (2)  becomes 

W=c-L  (a 

dt  m *  dy  dy  dp  res‘ 

here  p  labels  the  ^-component  of  momentum  p. 
One  can  solve  (6)  in  terms  of  the  characteristic 
curves 

P  =  ±  \Jp[+2m*  (^o)-^W)  =  V{p0,  Jo,  y), 

(7) 

where  p0  is  the  momentum  of  the  electron,  injected 
into  the  well  at  the  point  y=y$.  The  general 
solution  of  the  kinetic  equation  has  the  form 

fM)  =  l  (8) 

G{V(p,  j, /),/), 

where  the  kernel  M(p,y,y')  depends  on  the 
particular  shape  of  the  potential  (j){y )  and  can  be 
easily  calculated  in  the  explicit  form. 

Unfortunately,  iteration  methods  of  solution 
fail  in  the  solution  of  (3),  (8)  due  to  stability 
problems.  Because  of  that  we  have  applied  the 
following  variational  procedure  for  the  self-con¬ 
sistent  solution  of  (3),  (8).  We  introduce  the 
functional 


m  =  I  dy{<t>-C{<t>})\  (9) 

where 

=  ~df  $+47rg2^’ d*  m*  [ %MG • 

(10) 

Functional  J  equals  zero  for  the  exact  solution 
of  (3),  (8).  For  a  particular  solution  we  can  choose 
some  probe  functions  (f)pr(y,  ct),  where  ct  are 
variational  parameters.  These  parameters  are 
determined  by  the  condition  of  minimization  of 
J(Ci).  It  can  be  shown  that  our  variational 
formulation  of  the  problem  of  patterns  is  equiva¬ 
lent  to  the  initial  system  of  Eqs.  (3),  (6). 
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Using  this  method  we  have  analyzed  the 
possible  types  of  stationary  patterns.  The  possible 
types  of  patterns  are  the  same  as  for  the  case  of  the 
structures  with  small  range  of  bistable  biases.  The 
bistable  range  of  biases  can  be  split  into  the  two 
regions  $/<4><$c  and  $c<$<$h  with  the 
different  types  of  patterns.  In  the  first  region  the 
soliton-like  patterns  can  exist.  In  these  patterns  at 
large  \y\  the  system  is  in  the  low-current  state, 
while  in  the  certain  spatial  region  the  local  increase 
of  the  built-up  charge  and  tunneling  current  takes 
place.  At  4>c<4><<1>/2  the  possible  patterns  are 
anti-soliton-like  with  the  opposite  characteristics: 
at  high  \y\  the  system  is  in  the  high  current  state 
and  local  decrease  of  the  built-up  charge  and  the 
tunneling  current  in  the  finite  spatial  region.  The 
$  =  $>c  is  the  critical  bias:  in  this  case  the  special 
kink-like  pattern  occurs.  In  this  pattern  high  and 
low  current  states  coexist  in  the  different  spatial 
regions. 

Our  calculations  revealed  that  the  anti-soliton 
patterns  are  substantially  wider  than  soliton 
patterns.  This  is  because  for  soliton  pattern  in  the 
region  of  nonuniformity  tunneling  via  both  bar¬ 
riers  is  possible,  while  for  anti-soliton  only  tunnel¬ 
ing  via  collector  barrier  takes  place.  This  influence 
the  characteristical  length  which  an  electron  can 
pass  in  the  quantum  well,  and  consequently  the 
width  of  the  pattern.  The  degree  of  the  asymmetry 
in  the  spatial  scales  of  the  soliton  anti-soliton 
patterns  strongly  depends  on  the  degree  of 
asymmetry  of  the  transmission  coefficients  of 
emitter  and  collector  barriers.  To  avoid  numerical 
difficulties,  we  performed  numerical  simulations  of 
the  stationary  patterns  for  a  structure  with  thinner 
collector  barrier  with  respect  to  the  structure 
described  in  Section  3,  namely  with  ds2  =  3.4 nm. 
The  dependence  of  the  Ls  for  the  case  of  soliton- 
like  pattern  and  La  for  the  case  of  anti-soliton 
pattern  on  the  dimensionless  parameter 
q  =  $/)  is  presented  in  Figure  3. 

The  important  property  of  patterns  in  the  case 
of  pronounced  bistability  is  substantial  difference 
between  the  spatial  regions  of  localization  of 
nonuniformities  of  the  emitter -QW  and  the  QW- 


FIGURE  3  Dependence  of  the  width  of  the  soliton  pattern, 
Ls,  and  anti-soliton  pattern,  La ,  on  the  dimensionless  voltage 
parameter  q  =  ($-$/)/($/,-$/). 


collector  tunneling  currents.  This  is  due  to  i)  a 
strong  dependence  of  the  tunneling  injection  rate 
from  the  emitter  to  the  QW  on  the  position  of  the 
resonant  level  with  respect  to  the  bottom  of  the 
emitter  conduction  band  and  ii)  a  ballistic  leakage 
of  the  injected  electrons  over  the  QW  before  the 
tunneling  escape  to  the  contacts.  This  is  illustrated 
in  Figure  4.  In  the  upper  part  of  the  figure  the 
current  field  in  the  structure  is  shown  (spacing 
between  the  current  lines  is  proportional  to  the 
value  of  the  current  density).  In  the  lower  part  of 
the  figure  spatial  dependences  of  the  emitter -QW, 


FIGURE  4  The  spatial  dependence  of  the  current  for  the 
soliton-like  pattern  at  q  =  0.3.  In  the  upper  part  the  current  field 
is  depicted.  In  the  lower  part  the  emitter  (curve  1),  collector 
(curve  2,  multiplied  by  factor  5)  and  two-dimensional  lateral 
(curve  3)  currents  are  presented. 
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QW- collector  and  two-dimensional  lateral  cur¬ 
rents  are  depicted.  The  bias  at  which  calculations 
were  performed  corresponds  to  #  =  0.3. 

In  addition  to  the  stationary,  we  have  investi¬ 
gated  mobile  auto  wave  patterns.  In  fact,  they  are 
the  moving  kink-like  patterns,  describing  switch¬ 
ing  of  the  DBH  from  one  uniform  state  to  the 
other.  For  the  particular  value  of  the  bias  the 
unique  value  of  the  switching  wave  velocity  and 
the  type  of  the  switching  exist.  Namely,  for 
$/<<!><< E>c  the  switching  from  the  high  to  the 
low  current  state  is  possible,  while  for  $c  <  $  <  $h 
the  switching  from  the  high  to  the  low  current  state 
can  occur.  Due  to  the  ballistic  character  of  the 
horizontal  transport  the  characteristical  velocity  of 
the  switching  waves  is  determined  by  the  Fermi 
velocity  of  electrons  in  the  emitter  vF.  In  Figure  5 
the  dependence  of  the  switching  wave  velocity  (in 
units  Vf)  on  q  is  presented  for  the  structure 
described  in  Section  3.  The  insert  shows  the  spatial 
dependence  of  the  built-up  charge  in  the  stationary 
kink  at  3>  =  4>c.  The  positive  sign  of  velocity,  in 
accordance  with  mentioned  above,  corresponds  to 
the  switching  from  the  high  to  the  low  current 
state. 

Note,  that  (3)  is  obtained  for  the  infinitely 
conducting  emitter  and  collector.  For  autowaves, 
moving  with  the  velocity  of  the  order  of  vF  and 


FIGURE  5  The  dependences  of  the  switching  wave  velocity 
(in  units  vF)  on  dimensionless  voltage  parameter  ($-$/)/ 
($tj— $/).  The  spatial  dependence  of  the  built-up  charge  (in 
units  10ncm"2)  in  the  stationary  kink  is  shown  in  the  insert. 


spatial  scale  of  the  kink  transition  region  vFres , 
this  assumption  is  valid  for  the  structures,  in  which 
res  is  substantially  greater  than  the  characteristical 
time  of  relaxation  in  electrodes.  This  is  true  for  the 
heterostructures  with  thick  enough  barriers  and 
electrodes  with  high  conductivity.  Otherwise,  the 
characteristics  of  autowaves  (their  velocity,  for 
example)  can  be  modified  by  the  relaxation 
processes  in  the  electrodes. 


5.  SUMMARY 

In  a  conclusion,  we  have  investigated  the  phenom¬ 
enon  of  the  transverse  patterns  formation  in  the 
resonant  tunneling  double  barrier  diode  with 
the  wide  voltage  range  of  intrinsic  bistability  of 
the  current-voltage  characteristic.  These  patterns 
are  stationary  or  mobile  nonuniform  distributions 
of  the  built-up  charge  and  tunneling  current.  For 
the  pronounced  bistability  the  characteristics  of 
the  patterns  are  determined  by  the  ballistic  and 
nonlocal  character  of  lateral  transport  of  the 
resonant  electrons  in  the  QW.  This  fact  gives  rise 
to  specific  features  of  the  patterns  with  respect  to 
the  similar  phenomena  in  DBHs  with  a  small 
range  of  bistable  voltages.  For  numerical  simula¬ 
tions  a  special  variational  procedure  was  devel¬ 
oped. 
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Nonparabolicity  effects  in  two-dimensional  electron  systems  are  quantitatively 
analyzed.  A  formalism  has  been  developed  which  allows  to  incorporate  a  nonparabolic 
bulk  dispersion  relation  into  the  Schrodinger  equation.  As  a  consequence  of 
nonparabolicity  the  wave  functions  depend  on  the  in-plane  momentum.  Each  subband 
is  parametrized  by  its  energy,  effective  mass  and  a  subband  nonparabolicity  coefficient. 
The  formalism  is  implemented  in  a  one-dimensional  Schrodinger-Poisson  solver  which 
is  applicable  both  to  silicon  inversion  layers  and  heterostructures. 
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1.  INTRODUCTION 

To  accurately  model  the  high-field  transport  in 
silicon  inversion  layers,  several  authors  [3,  4]  have 
introduced  a  nonparabolicity  correction  in  the 
subband  dispersions.  In  this  work  we  quantita¬ 
tively  analyze  nonparabolicity  effects  in  various 
two-dimensional  electron  gases.  For  this  purpose  a 
self-consistent  Schrodinger-Poisson  solver  has 
been  developed,  capable  of  dealing  with  silicon 
inversion  layers  and  heterostructures.  For  hetero¬ 
structures,  position-dependent  material  para¬ 
meters  are  taken  into  account.  As  a  result  each 
subband  is  characterized  by  three  parameters,  E„, 
m„,  a„,  which  denote  the  subband  energy,  mass 


and  nonparabolicity  coefficient,  respectively.  This 
set  of  parameters  is  intended  to  serve  as  input  for 
high-field  transport  calculations. 

Our  approach  relies  on  the  effective-mass 
approximation  which  is  applicable  if  the  confining 
potential,  V(z),  satisfies  two  conditions  [3]: 

1 .  V(z)  is  slowly  varying  over  a  unit  cell, 

2.  matrix  elements  of  V(z)  between  Bloch  func¬ 
tions  of  different  bands  are  negligible. 

2.  SILICON  INVERSION  LAYERS 

Within  the  effective-mass  approximation  a  one¬ 
dimensional  Schrodinger  equation  can  be  derived 
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from  the  three-dimensional  one  provided  that  the 
potential  only  varies  in  one  dimension. 

(£(~i^K)  +  V(z))Uz\K)  =  En{K)Uz\K) 

(I) 

In  this  equation,  („  denotes  the  envelope  function, 
K  is  the  in-plane  wave  number,  and  E(K) 
represents  the  in-plane  dispersion  relation.  The 
bulk  dispersion  relation,  e(k),  is  assumed  to  have 
ellipsoids  as  equi-energy  surfaces,  e(k)  =  e{K2  / 
mxy  +  k2Jmz).  In  principle,  (1)  can  be  solved 
numerically  for  different  values  of  K  so  as  to 
obtain  a  point-wise  representation  of  E(K). 
However,  since  the  bulk  dispersion  is  given  by  an 
analytic  function  one  usually  is  interested  in 
obtaining  analytic  subband  dispersions  as  well. 
The  latter  can  be  found  by  applying  perturbation 
theory  at  K= 0.  The  kinetic  energy  operator  is 
expanded  into  a  Taylor  series,  and  terms  up  to  K4 
are  retained  in  order  to  get  information  on  the 
subband  nonparabolicity. 

/ K 2  1  d2\  „  „  ,  , 

The  unperturbed  problem  is  defined  by  T0,  and  the 
terms  containing  the  in-plane  wave  number  are 
considered  as  perturbation.  The  operators  Tt  are 
given  by: 


3.  HETEROSTRUCTURES 


tors.  The  usual  implicit  definition  of  the  kinetic 
energy  can  be  generalized  to  a  kinetic  energy 
operator,  even  when  the  parameters  are  position- 
dependent. 


T+Tot{z)T  = 


n2 


K2 

mxy  (z) 


9  i 

dzm2(z)  dz  ) 


(3) 


This  equation,  which  is  self-adjoint,  can  be  solved 
for  the  kinetic  energy  operator. 


T  = 

Go  = 

G,  = 
h(x)  = 


a~1/2h(Go  +  G\K2)oTxI2 

_al/2A_^_Aai/2 

dz  2  mz  dz 
a{z) 

2  mXy(z ) 

2x 

1  +  y/1  +  4x 


(4) 


In  analogy  with  (2)  the  kinetic  energy  operator  has 
to  be  decomposed  as  follows, 


T(Gq  +  Gj  K2)  «  n+T\K2+  T2K4 ,  (5) 

where  the  determination  of  the  operators  Tt  for 
heterostructures  is  more  complicated  than  for 
uniform  material  parameters  since  the  operators 
Go  and  G\  no  longer  commute. 


4.  SUBBAND  DISPERSION  RELATION 

The  eigenvalues  of  (1)  at  ^=0  are  denoted  by  E° 
and  the  eigenfunctions  by  C„°(z)  (n  =  0, 1, 2. . .).  The 
matrix  elements  of  Tx  and  J2  in  the  {^}  basis  are 
T\ j,m  and  T2x,m.  respectively.  Perturbation  theory 
yields  a  polynomial  representation  for  E„  ( K ): 

\miLn£'n  J 

(6) 


For  heterostructures  we  assume  nonparabolic  This  expression  allows  to  characterize  each  sub¬ 
dispersion  relations  for  the  different  semiconduc-  band  by  an  effective  mass,  m„,  and  a  nonparabo- 
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licity  coefficient,  an  defined  by 


Although  (6)  suggests  a  representation  of  En(K )  as 
a  polynomial  in  K  it  appears  favorable  to  assume  a 
second-order  polynomial  in  the  energy  instead. 

*2  js  2 

{E-E*n)(\+an{E-E*))=—  (9) 

By  comparison  with  En(K )  obtained  from  a 
solution  of  (1)  at  discrete  ^-points  it  was  found 
that  (9)  gives  an  excellent  approximation  for  En(K) 
in  a  wide  Grange,  whereas  the  polynomial  (6)  is 
applicable  only  for  sufficiently  small  K. 

5.  ELECTRON  DENSITY 

The  electron  density  is  given  as  the  sum  of  all 
position  probabilities  of  the  states  |«,  K)  weighted 
by  their  occupation  probabilities. 

«w=fvEEia^)i2/(W)  (io) 

n  K 

In  (10)  /  denotes  the  Fermi-Dirac  distribution 
function  and  gv  the  valley  degeneracy  factor.  The 
summation  over  K  is  usually  converted  to  an 
integral  by  employing  the  two-dimensional  density 
of  States,  p  2D,n* 

P2D,n(E)  —  £v— Tj(l  +  2an(E  —  E®))  (11) 

m 

poo 

«(z)  =  E/  \u^fO\2mp2DME)dE  (i2) 

The  hole  density  is  calculated  using  Boltzmann 
statistics.  Once  the  carrier  densities  are  known  the 
electrostatic  potential  is  obtained  by  solving  the 
one-dimensional  Poisson  equation.  During  each 


self-consistent  iteration  step  the  Schrodinger  equa¬ 
tion  and  the  linearized  Poisson  equation  are 
solved. 


6.  DISCRETIZATION 

The  wave  functions,  which  are  assumed  to  vanish 
at  the  boundaries,  are  represented  by  a  Fourier 
series.  After  truncation  of  the  series  the  Schrodin¬ 
ger  equation  (1)  is  converted  into  an  algebraic 
eigenvalue  equation  [1]. 

The  linearized  Poisson  equation  is  discretized  in 
real  space  using  the  finite  difference  method.  This 
yields  a  tridiagonal  equation  system  which  can  be 
efficiently  solved.  During  each  iteration  step  the 
Fast  Fourier  Transform  is  applied  to  transfer  the 
required  quantities  from  real-space  representation 
to  momentum  representation  and  vice  versa. 


7.  RESULTS  AND  DISCUSSION 

7.1.  Silicon  Inversion  Layer 

A  MOS  capacitor  has  been  simulated.  The 
parameters  were  chosen  as  follows:  W/-0.98, 
m,  =  0.19,  a  =  0.7eV_1,  tox  =  7nm,  JV^  =  5-1016 
cm-3,  VGB= 2.5V,  T=  300  K.  A  nonparabolic  bulk 
dispersion  relation  was  assumed: 

£= _ h. _  1=£(E.+a\ 

1  +  y/l  +  4cry 5  2  \mxy  mj 

(13) 

In  Figure  1  the  Fourier  coefficients  of  the  first  two 
wave  functions  are  plotted.  This  corresponds  to  a 
representation  of  the  wave  functions  in  momen¬ 
tum  space.  The  number  of  harmonics  equals 
N=  64.  Due  to  nonparabolicity  the  motion  of  the 
carriers  normal  to  the  interface  is  no  longer 
decoupled  from  the  motion  parallel  to  the  inter¬ 
face,  as  is  the  case  for  parabolic  bands.  For  non¬ 
vanishing  K  narrower  wave  functions  are  observed 
than  for  ^=0. 
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FIGURE  1  Spectrum  of  the  first  three  wave  functions  in  a 
silicon  inversion  layer. 

7.2.  InP-Based  Pseudomorphic  HEMT 

A  heterostructure  after  [2]  has  been  simulated 
(Tab.  I).  In  the  simulation  both  nonparabolic  and 
parabolic  bulk  dispersions  have  been  considered. 
When  moving  across  this  heterostructure  an 
electron  will  experience  considerable  variations  of 
the  bandstructure  parameters. 

The  subband  parameters  mn  and  an  are  plotted 
in  Figures  2  and  3  for  20  subbands.  Strong  varia¬ 
tions  of  these  parameters  can  be  observed.  In  the 
nonparabolic  case,  the  mass  increases  significantly 
when  going  from  subband  3  to  4,  while  in  turn  the 
nonparabolicity  coefficient  decreases.  This  means 
that  the  changes  of  mn  and  an  are  correlated.  This 
behavior  can  be  understood  when  considering  the 
wave  functions  (Fig.  4).  While  an  electron  in 
subband  3  resides  preferably  in  the  InGaAs 
channel  (low  mass,  high  a),  an  electron  in  subband 
4  feels  predominantly  the  material  properties  of 
the  AlInAs  barrier. 


TABLE  I  Parameters  of  the  InP  heterostructure.  The  donor 
layer  is  doped  with  A^-2.1018  cm-3 


Layer 

Material 

t 

(nm) 

mass 

(mo) 

a 

(eV-1) 

barrier 

Alo.48ln0.52A.S 

20 

0.082 

0.84 

donor 

AI().48lfl0.52  As 

12.5 

0.082 

0.84 

spacer 

Alo.48lflO.52As 

2 

0.082 

0.84 

channel 

IflO.53Gao.47As 

20 

0.038 

1.02 

substrate 

InP 

20 

0.077 

0.83 

FIGURE  2  The  subband  masses  for  the  InP  heterostructure 
for  both  the  nonparabolic  and  parabolic  case. 


FIGURE  3  The  subband  nonparabolicity  coefficients  for  the 
InP  heterostructure.  Although  for  the  lower  curve  parabolic 
bulk  dispersions  are  assumed,  the  subbands  are  nonparabolic 
due  to  the  material  inhomogeneity. 


FIGURE  4  Wavefunctions  3  and  4  in  the  InP  heterostruc¬ 
ture. 
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Physical  simulation  of  semiconductor  devices  at  high  frequencies  involves  not  only 
semiconductor  transport  issues  but  also  electromagnetic  wave  propagation  issues.  In 
order  to  obtain  the  nonlinear  and  the  large-signal  characteristics  of  the  semiconductor 
devices,  an  electromagnetic  model  should  replace  the  traditional  quasi-static  model  in 
the  device  simulator.  In  this  paper,  the  advantages  of  a  semiconductor  device  simulator 
combining  an  electromagnetic  and  an  electron  transport  models  are  presented.  This 
study  is  based  on  a  semiconductor  device  simulator  that  couples  a  semiconductor  model 
to  the  3D  time-domain  solution  of  Maxwell’s  equations.  The  electromagnetic  wave 
propagation  effects  on  the  millimeter-wave  FETs  are  thoroughly  analyzed.  The  use  of 
the  electromagnetic  model  over  the  conventional  quasi-static  model  provides  the  actual 
device  response  at  high  frequencies.  It  also  shows  the  nonlinear  energy  build-up  along 
the  device  width  whereas  the  quasi-static  model  provides  a  linear  increase  of  energy.  The 
combined  model  is  capable  of  predicting  the  device  nonlinearity  and  harmonic 
distortion  of  amplifier  circuits  at  large  signal. 


Keywords:  Device  simulation,  hydrodynamic  models,  FDTD,  full-wave  simulators 


1.  INTRODUCTION 

With  the  advancement  of  semiconductor  technol¬ 
ogy,  the  techniques  required  to  analyze,  design, 
and  optimize  the  semiconductor  devices  are  be¬ 
coming  increasingly  sophisticated.  The  computer 
simulation  programs  are  now  essential  tools  for 


device  engineers.  These  numerical  simulations 
based  on  physical  modeling  can  be  used  to  predict 
and  provide  better  understanding  of  the  device 
behavior.  However,  the  down-sizing  of  the  active 
device  dimensions  has  presented  new  challenges  to 
the  device  and  circuit  designer.  In  submicron 
semiconductor  devices,  several  new  transport 
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phenomena  develop  and,  consequently,  have  to  be 
considered  in  device  modeling.  The  electrons  do 
not  reach  equilibrium  transport  conditions  while 
traveling  along  the  conducting  channel.  To  in¬ 
corporate  the  effects  of  nonstationary  dynamics  in 
semiconductor  devices,  the  hydrodynamic  model 
based  on  moments  of  Boltzmann’s  transport 
equation  is  used  [1-3].  The  transport  parameters 
are  taken  as  functions  of  average  electron  energy 
rather  than  the  local  electric  field. 

The  device  modeling  at  high  frequencies  requires 
special  attention.  At  high  frequencies,  the  coupling 
between  the  electrons  and  the  propagating  electro¬ 
magnetic  waves  can  not  be  neglected  in  submicron 
devices.  The  short  period  of  the  propagating  EM 
wave  approaches  the  electron  relaxation  time  and 
as  the  electrons  need  a  finite  time  to  adjust  their 
velocities  to  the  changes  in  field,  electron  transport 
is  directly  affected  by  the  propagating  wave.  In 
such  cases,  the  quasi-static  semiconductor  device 
models  fail  to  represent  accurately  the  exact  device 
response.  In  addition,  the  electrodes  extending 
along  the  device  width  behave  like  transmission 
lines  with  nonlinear  characteristics.  These  facts  call 
for  the  necessity  of  incorporating  wave  effects  in 
a  three-dimensional  model.  This  goal  can  be 
achieved  by  taking  full  account  of  the  varying 
fields  inside  the  device.  The  acceptable  method  for 
representing  these  various  forces  is  to  combine  an 
electromagnetic  model  with  a  semiconductor  de¬ 
vice  model  which  leads  to  the  Combined  Electro¬ 
magnetic  and  Solid-State  (CESS)  simulator  [4]. 

On  the  other  hand,  the  increasing  demand  of 
processing  and  transmitting  more  informations  at  a 
faster  rate,  drives  the  analog  and  digital  electronic 
systems  to  operate  at  higher  clock  speeds.  At  the 
same  time,  to  curtail  the  production  cost,  the 
manufacturers  are  more  inclined  towards  heavily 
densed  integrated  circuits.  In  these  high  density 
integrated  circuits,  there  are  many  closely  spaced 
active  and  passive  devices.  As  a  result,  there  are 
some  detrimental  effects  on  the  circuit  performance 
at  high  frequencies  due  to  crosstalk  caused  by 
coupling,  surface  waves  and  radiation  effects.  In 
such  cases,  the  circuit  modeling  issue  becomes  more 


intensive.  The  circuit  design  should  be  based  on 
advanced  global  model  which  takes  the  electro¬ 
magnetic  wave  effects  into  consideration. 

The  issues  like  device-wave  interaction,  electro¬ 
magnetic  coupling,  discontinuity  problem,  linear 
and  nonlinear  behavior  of  passive  and  active 
devices,  and  EM  radiation  effects  are  addressed 
in  the  global  modeling.  The  computer  memory 
requirement  as  well  as  the  simulation  time  is 
reduced  by  using  a  hybridization  approach  in 
global  modeling.  The  amplifier  is  divided  into 
three  regions,  preserving  the  physical  character¬ 
istics  of  the  amplifier  circuit  by  taking  the 
reflections  at  the  breaking  points  into  considera¬ 
tion.  The  full-wave  analysis  of  each  region  is 
performed  individually  and  coupled  to  the  next 
stage  properly  with  all  the  required  informations 
from  the  preceding  stage.  This  technique  enables 
one  to  use  large  space  step,  and  hence,  large  time 
step  in  matching  networks. 

2.  NUMERICAL  MODEL 

The  CESS  simulator  is  a  physically  based  model 
which  takes  care  of  nonisothermal  transport  and 
nonstationary  electron  dynamics  as  well  as  elec¬ 
tromagnetic  wave  propagation  effects.  This  model 
couples  the  hydrodynamic  model  to  a  3D  time- 
domain  solution  of  Maxwell’s  equations.  The 
hydrodynamic  model  is  based  on  the  moments  of 
the  Boltzmann’s  transport  equation  obtained  by 
integration  over  the  momentum  space.  The  elec¬ 
tromagnetic  wave  propagation  effects  can  be 
completely  characterized  by  solving  Maxwell’s 
equations.  These  equations  are  first-order  linearly 
coupled  differential  equations  relating  the  field 
vectors,  current  densities  and  charge  densities  at 
any  point  in  space  at  any  time.  The  coupling 
between  the  two  models  is  established  by  using  the 
fields  obtained  from  the  solution  of  Maxwell’s 
equation  in  the  semiconductor  model  to  calculate 
the  current  densities  inside  the  device.  These 
current  densities  are  used  to  update  the  electric 
and  the  magnetic  fields  using  Maxwell’s  equations 
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with  an  applied  high  frequency  sinusoidal  excita¬ 
tion.  The  initialization  is  provided  by  solving  the 
hydrodynamic  model  for  the  dc  charges  and 
currents  in  response  to  a  specified  dc  operating 
point.  In  this  manner,  the  coupling  between  the 
two  models  results  in  the  overall  high  frequency 
characteristics  of  the  semiconductor  devices.  The 
details  of  the  mathematical  representation  and  the 
coupling  procedure  can  be  found  in  [4].  The  finite- 
difference  time-domain  scheme  is  used  in  semi¬ 
conductor  device  discretization. 


3.  RESULTS  AND  DISCUSSION 

The  MESFET  (Fig.  1)  used  in  this  work  have  the 
following  parameters.  Gate-source  spacing  = 
0.4  pm,  gate-drain  spacing  =  0.56  pm,  gate 
length  =  0.24  pm,  undoped  GaAs  thickness  = 
0.3  pm,  active  layer  thickness  =  0.1  pm,  gate 
width  =  250  pm.  In  order  to  validate  the  CESS 
simulator,  a  MODFET  structure  similar  to  Shawki 
et  al.  [5]  is  simulated  to  compare  the  performances. 
The  transconductances  are  compared  with/without 
taking  the  traps  into  account.  They  exhibit  reason¬ 
able  agreement  with  each  other  in  Figure  2. 

To  demonstrate  the  electromagnetic-wave  pro¬ 
pagation  effects  for  MESFET,  a  sinusoidal  excita¬ 
tion  of  peak  0. 1 V  and  frequency  80  GHz  is  applied 
between  the  gate  and  the  source  electrodes.  The 
excitation  is  applied  as  a  plane  source  at  z  =  0,  as 
shown  in  Figure  1.  The  CESS  model  is  then  solved 
for  a  few  rf  cycles  to  avoid  the  effects  of  the 


FIGURE  2  Comparison  of  transconductance  of  MODFET 
with  Ref.  [5]  with  or  without  including  the  traps. 

transients  on  the  ac  solution.  The  output  is 
obtained  across  the  drain  and  the  source  at  several 
points  along  the  device  width  in  the  z-direction. 
The  output  voltage  wave,  as  shown  in  Figure  3, 
first  decreases  and  then  increases  along  the  device 
width.  Early  in  the  simulation,  the  electronic  effect 
is  not  present  and  the  wave  amplitude  decreases 
along  the  device  width.  Later,  as  more  and  more 
electromagnetic  energy  is  propagated  along  the 
device  width,  the  wave  energy  builds  up,  and  the 
wave  amplitude  increases.  This  figure  clearly 


FIGURE  3  The  electromagnetic  wave  propagation  effects  on 
output  voltages  of  MESFET  for  different  device  widths. 
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demonstrates  the  direct  relationship  between  the 
device  gain  characteristics  and  the  electromagnetic 
wave  propagations. 

The  advantage  of  using  the  electromagnetic 
model  in  device  simulation  is  demonstrated  in 
Figure  4.  MESFET  is  simulated  using  the  quasi¬ 
static  model  as  well  as  the  CESS  simulator.  In 
quasi-static  model,  Poisson’s  equation  is  solved  to 
get  the  electric  fields.  In  electromagnetic  model, 
Maxwell’s  equations  are  solved  to  obtain  the 
electric  and  the  magnetic  fields.  In  Figure  4,  the 
output  voltage  wave  monotonously  increases 
along  the  device  width  in  quasi-static  model.  On 
the  other  hand,  in  electromagnetic  model,  the 
output  voltage  wave  nonlinearly  increases  with  the 
device  width.  This  phenomenon  is  expected  due  to 
the  device-EM  wave  interaction.  The  exchange  of 
energy  between  the  electrons  and  the  electromag¬ 
netic  wave  takes  place  along  the  device  width.  This 
behavior  is  absent  when  the  output  is  obtained 
from  the  quasi-static  analysis.  This  figure  strongly 
supports  the  use  of  the  electromagnetic  model  for 
device  simulation  at  high  frequencies. 

To  demonstrate  the  large  signal  potential  of  the 
CESS  simulator,  the  gain  of  a  0.5  pm  x  1000  pm 
MESFET  is  calculated  for  a  small  signal  of  0.1  V 


FIGURE  4  The  comparison  of  output  voltage  variations  with 
device-width  obtained  from  the  electromagnetic  model  and  the 
quasi-static  model  for  MESFET. 


and  a  large  signal  of  0.3  V.  As  shown  in  Figure  5, 
the  gain  becomes  lower  as  the  amplitude  increases, 
which  is  expected.  The  strength  of  this  approach  is 
not  in  simply  confirming  that  larger  amplitudes 
reduce  the  gain,  but  in  estimating  the  reduction 
directly,  using  the  physical  model. 

The  potential  of  the  CESS  model  is  further 
investigated  by  simulating  an  amplifier  circuit  with 
input  and  output  matching  networks  as  shown  in 
Figure  6.  The  physical  simulation  of  the  entire 
amplifier  is  performed  using  a  global  modeling 
technique  [6].  The  optimized  transistor  parameters 
(in  terms  of  maximum  gain)  used  in  the  amplifier 


Frequency  (GHz) 

FIGURE  5  The  comparison  of  gain  characteristics  of 
MESFET  at  small  and  large  signal  operations. 


FIGURE  6  GaAs  transistor  amplifier  with  input  and  output 
matching  networks. 
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are:  gate  length  =  0.22  pm,  aspect  ratio  =  2.5  and 
the  active  layer  doping  =  2.2 xl017/cm3.  The  am¬ 
plifier  is  designed  for  40  GHz  frequency.  The 
entire  amplifier  is  simulated  by  applying  a  small 
signal  of  amplitude  0.1  V  and  large  signal  of 
amplitudes  0.3  and  1.0  V  at  different  frequencies. 
In  Figure  7,  S2 1  is  presented  for  small  and  large 
signals  at  different  frequencies.  At  the  design 
frequency  of  40  GHz,  the  gain  is  7.38  dB  for  small 
signal  and  6.44  and  5.15  dB  for  large  signals  of  0.3 
and  1.0  V,  respectively.  Thus  it  is  observed  that  for 
the  same  design  the  gain  drops  at  large  signals.  In 
small  signal,  the  gain  drops  slightly  as  the 
frequency  is  shifted  away  from  the  40  GHz  point. 
On  the  other  hand,  in  large  signal,  the  gain 
reduction  at  frequencies  other  than  design  fre¬ 
quency  is  higher.  Nonlinearity  in  the  device 
behavior  is  evident  from  this  figure. 

Once  the  large  signal  response  of  the  amplifier  is 
obtained,  it  is  interesting  to  study  its  frequency 
content  and  to  identify  the  harmonics.  The  output 
wave  contains  a  significant  amount  of  third  and 
fifth  harmonic  components  as  shown  in  Figure  8. 
For  the  large  signal  input  of  0.3  V,  the  output 
power  contains  4.8%,  11.6%  and  13%  of  the 


FIGURE  7  The  dependence  of  scattering  parameter  S2 1  on 
frequency  at  small  and  large  signals  for  the  amplifier  with  the 
optimized  transistor. 
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FIGURE  8  The  intermodulation  distortion  at  the  third  and 
the  fifth  harmonic  components  for  the  large  signal  of  amplitude 
0.3  at  30,  40  and  50  GHz. 


fundamental  at  30,  40  and  50  GHz,  respectively,  at 
the  third  harmonic.  The  fifth  harmonic  contains 
6%,  0.7%  and  0.3%  of  the  fundamental  at  30,  40 
and  50  GHz,  respectively.  This  shows  that  a 
considerable  amount  of  power  is  transferred  at 
the  harmonic  components.  Thus  the  global  model 
is  able  to  predict  the  nonlinearity  of  the  device 
behavior  and  show  the  different  harmonic  compo¬ 
nents  generated  at  the  amplifier  output  due  to 
nonlinearity. 


4.  CONCLUSIONS 

Advantages  of  semiconductor  device  and  circuit 
simulator  combining  an  electromagnetic  and  an 
electron  transport  models  are  presented.  The 
simulation  confirms  that  a  significant  device-wave 
interaction  takes  place  in  semiconductor  devices  at 
high  frequencies.  The  use  of  the  electromagnetic 
model  over  the  conventional  quasi-static  model 
provides  the  actual  device  response  at  high 
frequencies.  It  also  shows  the  nonlinear  energy 
build-up  along  the  device  width  whereas  the  quasi¬ 
static  model  provides  a  linear  increase  of  energy. 
An  approach  towards  global  modeling  of  milli- 
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meter-wave  circuits  is  also  presented  in  this  paper. 
The  global  model  is  able  to  characterize  the 
electromagnetic  coupling,  device-EM  wave  inter¬ 
action  and  the  EM  radiation  effects  of  the  very 
closely  spaced  integrated  circuit  amplifier.  The 
global  modeling  technique  is  capable  of  represent¬ 
ing  the  nonlinearity  and  the  harmonic  distortion  of 
the  amplifier  circuit.  More  effects  will  be  added  in 
the  future  including  thermal  and  packaging  effects. 
By  incorporating  all  these  effects  in  the  circuit 
simulation,  a  milestone  will  be  reached  towards  the 
comprehensive  global  modeling. 
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In  recent  years,  artificially  layered  microstructure  have  been  considered  as  candidates 
for  better  thermoelectrics.  In  this  work  we  examine  transport  properties  of  the  type-II 
broken-gap  InAs/GaSb  superlattice.  We  use  the  effective  bond  orbital  model  for  an 
accurate  description  of  the  band  structures.  Theoretical  results  of  thermoelectric 
transport  coefficients  and  the  dimensionless  figure  of  merit  for  an  (8,8)-InAs/GaSb 
type-II  superlattice  are  presented. 
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1.  INTRODUCTION 

From  about  1940  to  1965,  many  scientists  made 
tremendous  effort  looking  for  superior  thermo¬ 
electric  materials.  The  advantage  of  using  these 
solid  state  devices  are  compactness,  quietness  (no 
moving  parts),  freedom  from  corrosion,  localized 
heating  or  cooling,  and  reliability.  The  effort 
dwindled  eventually  because  performance  of  most 
thermoelectric  materials  found  at  that  time  were 
too  poor  to  be  used  for  practical  commercial 
applications.  Recently,  there  is  a  renewed  interest 
in  this  area  [1,  2],  driven  by  the  following  reasons: 
First,  thermoelectric  technology  is  environmen¬ 
tally  cleaner  than  traditional  compressor-based 
refrigeration  technology  since  it  does  not  use 
chlorofluorocarbons.  Second,  several  new  materi¬ 


als  were  identified  as  potential  candidates  for 
better  thermoelectrics,  including  the  filled  skutter- 
udite  antimonides  [3]  and  PbTe/Pbi_xEuxTe  mul¬ 
tiple-quantum-well  structures  [4].  Among  the  new 
materials,  superlattices  attracted  many  scientists’ 
attention  [5,  6,  7,  8,  9].  The  interest  can  be  traced 
back  to  the  quantitative  results  first  obtained  by 
Hicks  and  Dresselhaus  [5]  where  huge  enhance¬ 
ment  of  thermoelectric  properties  was  predicted 
for  superlattice  structures. 

In  the  present  work,  we  focus  our  interest  on  the 
type-II  broken-gap  InAs/GaSb  superlattices. 
Superlattices  consisting  of  combinations  of  III- 
Y  semiconductors  with  type-II  band  alignments 
are  of  interest  for  infrared  applications,  including 
IR  detectors  [10]  and  laser  diodes  [11,  12].  This  is 
because  their  energy  gaps  can  be  made  smaller 
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than  their  constituents.  However,  for  most  of  these 
applications,  cooled  operation  are  desirable  for 
good  performance.  If  reasonable  thermoelectric 
properties  of  such  superlattice  structure  could  be 
obtained,  directly  integration  of  thermoelectric 
cooling  devices  with  a  set  of  IR  detectors  or  laser 
diodes  may  offer  some  advantage.  Our  aim  here  is 
to  present  the  early  results  of  our  ongoing 
theoretical  effort  to  identify  and  characterize  the 
thermoelectric  properties  of  the  type-II  broken- 
gap  InAs/GaSb  superlattices.  In  Sec.  II,  the 
theoretical  framework  for  calculating  electronic 
contributions  to  the  thermoelectric  properties  of 
superlattices  using  realistic  band  structure  models 
is  presented.  Our  results  for  an  (8, 8)-InAs/GaSb 
superlattice  is  presented  in  Sec.  III. 


2.  MODEL 

The  dimensionless  figure  of  merit  for  a  thermo¬ 
electric  material  is  given  by  [14,  15] 


The  low-field  transport  coefficients  for  thermo¬ 
electric  materials  are  defined  by 

J  =  aE  -aSVT  (3) 

JQ  =  TaS  E  -  ko  VT  (4) 

where  J  is  the  electric  current  density,  JQ  the 
thermal  current  density,  E  the  electric  field,  T 
the  temperature,  a  the  electrical  conductivity,  S 
the  thermopower,  and  k0  the  thermal  conductivity 
at  zero  electric  field.  In  general,  a,  S ,  and  k0  are 
3x3  matrices  in  the  Cartesian  coordinates.  Note 
that  the  value  of  k  needed  for  computing  Z  is  the 
total  heat  flow  for  zero  electrical  current,  so  it  is 
given  by 

k  =  Ko  —  TaS2  +  ki  (5) 

where  is  the  lattice  contribution  to  the  thermal 
conductivity. 

Using  the  relaxation  time  approximation  to  the 
Boltzmann  equation,  the  transport  coefficients  are 
given  by  [16] 


where  S  is  the  thermopower  (thermoelectric  power 
or  Seebeck  coefficient),  a  is  the  electrical  con¬ 
ductivity,  k  is  the  total  thermal  conductivity 
(k  =  k\  +  Ke,  the  lattice  and  electronic  contribu¬ 
tions,  respectively),  and  T  is  the  temperature.  The 
maximum  Coefficient  Of  Performance  (COP)  is 
directly  related  to  the  dimensionless  figure  of  merit 
of  the  material  by  [14,  15] 


<'=‘1£>(-i)S(t)'  (6) 
Tas=~eiy£{~W)xi£)i£~^ (?) 
TKc=ii<k(~^) 2(£)(£_m)2’  w 

where  fx  is  the  chemical  potential,  e  the  electron 
charge,  and  /0  the  Fermi-Dirac  distribution  func¬ 
tion.  Here  2(e)  is  given  by: 


n  +zr-  7h/rc 
VTTzT+i 


where  Tc,  Th  are  the  temperatures  at  the  cold  and 
hot  junctions,  respectively,  and  T,  which  is  equal 
to  (7j,  +  Tc)/2,  is  the  mean  absolute  temperature, 
for  a  single-stage  thermoelectric  cooler.  As  ZT 
goes  to  infinity,  the  COP  goes  to  the  thermo¬ 
dynamic  limit  of  Carnot  efficiency. 


2(e)  -  T(e)£r(k)r(k)6(e  -  e(k))  (9) 

k 

where  y(k)  =  Vks(k)/H  is  the  semi-classical  carrier 
velocity,  e(k)  is  the  dispersion  relation  for  the 
carriers,  and  r(e)  is  the  energy-dependent  relaxa¬ 
tion  time  tensor,  taken  to  be  ^-independent.  The 
summation  is  over  the  first  Brillouin  zone. 
Equations  (6),  (7),  and  (8)  are  the  basic  results  of 
the  theory  of  electronic  contributions  to  the 
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thermoelectric  effects.  It  should  be  noted  that 
phonon  contributions  are  ignored  here. 

The  theoretical  scheme  we  used  for  calculating 
of  the  electronic  structure  of  InAs/GaSb  super¬ 
lattices  is  the  Effective  Bond-Orbital  Model 
(EBOM)  [13].  This  method  combines  the  virtues 
of  the  kp  and  the  tight-binding  methods.  The 
basic  idea  is  to  use  a  minimum  number  of  bond 
orbitals  to  describe,  as  accurately  as  possible,  the 
most  relevant  portion  of  the  bulk  band  structures, 
and  then  use  them  in  a  supercell  calculation  to 
obtain  superlattice  band  structures.  In  our  case, 
eight  bond  orbitals  per  unit  cell,  including  the  s - 
like  bond  orbitals  with  spin  up  and  spin  down, 
four  bond  orbitals  each  with  total  angular 
momentum  J  =  3/2  (made  of  /?-like  states  coupled 
with  spin)  and  two  additional  bond  orbitals  each 
with  total  angular  momentum  /=  1/2  (also  made 
of  /?-like  states  coupled  with  spin),  are  used.  This  is 
because  the  superlattice  states  of  interest  contain 
admixture  of  both  valence-band  and  conduction- 
band  characteristics.  We  assume  that  all  the  bond 
orbitals  are  sufficiently  localized  so  that  the 
interaction  between  orbitals  separated  farther  than 
the  nearest  neighbor  distance  can  be  ignored.  All 
nonvanishing  interaction  parameters  can  then  be 
directly  related  to  the  effective  masses  or  other 
band  parameters  of  the  k  *  p  perturbation  theory. 
Based  on  this  model,  an  accurate  band  structure 
could  be  obtained  for  values  of  k  near  the  zone 
center. 

Our  strategy  for  computing  thermoelectric 
properties  is  as  follows.  First,  we  calculate  the 
superlattice  band  structures  by  using  SOBO 
model.  Then,  we  perform  full  Brillouin  zone 
integration  to  obtain  2  (e).  Finally,  we  use  Equa¬ 
tions  (6),  (7)  and  (8)  to  compute  the  transport 
coefficients. 


3.  RESULTS  AND  DISCUSSION 

Figure  1  shows  our  calculated  band  structure  for 
an  InAs/GaSb  superlattices  eight  monolayers  of 
InAs  and  eight  monolayeres  of  GaSb  per  period. 


Band  Structure 


(8,8)  InAs/GaSb  Superlattice 


FIGURE  1  Illustration  of  band  structure  of  (4, 4)  InAs/GaSb 
superlattices.  Note  that  in  our  calculation  the  strain  effect  is 
included. 


The  conduction  band  minimum  of  bulk  InAs  is 
taken  to  be  at  0  eV.  Experimental  band  offset 
value  [17]  between  InAs  and  GaSb  then  puts  the 
valence  band  top  of  GaSb  at  approximately  200 
meV  above  the  conduction  band  minimum  of  the 
InAs  conduction  band  edge.  In  the  figure,  super¬ 
lattice  band  structure  perpendicular  and  parallel  to 
growth  directions  are  shown.  The  valence  subband 
maximum  is  found  at  0.086  eV,  while  the  conduc¬ 
tion  subband  minimum  at  0.303  eV,  yielding  a 
superlattice  bandgap  of  0.217  eV.  Note  that  the 
conduction  subband  structure  along  the  super¬ 
lattice  growth  direction  is  still  very  dispersive.  This 
is  due  the  broken-gap  band  alignment,  the  small 
InAs  conduction  band  effective  mass,  and  the 
relatively  short  period. 

To  compute  thermal  electric  properties,  we  need 
to  know  relaxation  times.  Lacking  specific  knowl¬ 
edge  of  relaxation  times  in  superlattice  structures, 
we  made  an  estimate  from  the  mobility  data 
for  bulk  InAs  and  GaSb,  and  assume  a  value  of 
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r(e)  «  10_13sec.  All  of  our  calculations  assume  a 
temperature  of  300  K.  In  Figure  2,  we  plot  the 
thermoelectric  transport  coefficients  as  functions 
of  the  chemical  potential.  Due  to  the  fact  that 
transport  along  the  growth  (transverse)  direction  is 
impeded  by  scattering  from  the  superlattice  inter¬ 
faces,  the  transverse  components  (zz)  of  electrical 
conductivity  (cr)  and  electronic  contribution  to 
thermal  conductivity  (ne)  are  always  smaller  than 
the  parallel  (xx)  components.  The  transverse 
components  of  all  three  transport  coefficients  show 
strong  oscillations  as  functions  of  chemical  poten¬ 
tial.  This  can  be  understood  by  examining  the 
superlattice  minibands  along  the  transverse  ([001]) 
direction.  We  can  easily  see  that  the  oscillations 
track  the  miniband  edges. 

In  order  to  calculate  the  thermoelectric  figure  of 
merit  for  this  structure  we  need  the  lattice 
contribution  to  the  thermal  conductivity.  From 
the  experimental  data  of  bulk  InAs  and  GaSb,  we 
take  the  lattice  thermal  conductivity  of  superlattice 
as  the  average  of  two  bulk  values,  which  is 
approximately  35  W/K  m.  This  procedure  prob¬ 
ably  overestimates  the  values  of  lattice  conductiv¬ 


ity  since  we  would  expect  interface  scattering  to 
lower  thermal  conductivities.  In  Figure  3,  we  show 
the  dimensionless  figure  of  merit  ZT  v s.  the 
chemical  potential.  For  parallel  transport,  the 
optimum  value  occurs  when  chemical  potential  is 
about  0  eV.  Its  value  is  about  0.033  which  is  not 
better  than  the  bulk  value.  This  result  is  opposite 
to  that  of  Hicks’  calculation  [5]  where  ZT  for  a 
type-I  superlattices  better  than  bulk  materials.  The 
difference  is  probably  due  to  the  much  stronger 
inter-well  coupling  in  the  type-II  broken  gap 
superlattices. 

While  our  calculations  have  carefully  treated 
band  structure  effects,  the  models  we  used  to 
estimate  relaxation  times  and  lattice  thermal 
conductivities  have  been  rather  crude.  In  particu¬ 
lar,  it  is  likely  that  we  considerably  over-estimated 
the  superlattice  thermal  conductivity  [2].  These 
issues  must  be  addressed  before  we  could  obtain  a 
more  realistic  description  of  superlattice  thermo¬ 
electric  properties.  In  summary,  we  have  calcu¬ 
lated  the  thermoelectric  properties,  including 
conductivity,  electronic  contribution  to  thermal 
conductivity,  and  thermal  power,  for  a  type-II 


Transport  Coefficients 


FIGURE  2  Shown  are  the  conductivity,  electronic  contribu¬ 
tion  to  the  thermal  conductivity,  and  thermopower  vs.  the 
chemical  potential  at  T=  300  K. 


Figure  of  Merit 


FIGURE  3  ZT  vs.  chemical  potential  for  the  (8, 8)-InAs/ 
GaSb  superlattices. 
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broken-gap  InAs/GaSb  superlattice  using  realistic 
band  structure  models.  The  computational  tools 
developed  form  a  basis  for  further  explorations  of 
superlattice  thermoelectric  properties. 
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We  generalize  the  Multiband  Quantum  Transmitting  Boundary  Method  (MQTBM)  for 
computing  transmission  coefficients  in  heterostructure  to  tight-binding-like  band 
structure  models  with  non-orthogonal  basis  and  multiple  neighbor  interactions.  We 
implement  this  method  based  on  the  newly  developed  planar-basis  pseudopotential 
method  which  uses  the  generalized  planar  Wannier  function  basis.  We  demonstrate  the 
method  by  computing  transmission  coefficients  for  a  GaAs/AlAs  double  barrier 
structure. 


Keywords:  Resonant  tunneling,  tight  binding,  non-orthogonal  basis,  transmission  coefficient 


1.  INTRODUCTION 

Since  Chang,  Esaki  and  Tsu  first  observed  resonant 

tunneling  effect  in  heterostructures  in  1974  [1], 
many  theoretical  methods  have  been  developed  to 
model  the  physics  of  resonant  tunneling  phenom¬ 
enon  in  semiconductor  heterostructures.  The  most 
commonly  used  formalism  for  computing  tunnel¬ 
ing  probabilities  in  these  structures  is  the  transfer- 
matrix  method.  Unfortunately,  this  method  is 
numerically  unstable,  and  especially  so  for  multi¬ 
band  models.  One  solution  which  circumvents  the 
instability  problem  is  the  Multiband  Quantum 
Transmitting  Boundary  Method  (MQTBM)  [2]. 


The  MQTBM  is  a  multiband  realization  of  the 
Quantum  Transmitting  Boundary  Method  origin¬ 
ally  developed  by  Lent  and  Kirkner  [3]  for  treating 
quantum  transport  in  nanostructures.  Thus  far, 
MQTBM  has  been  implemented  for  several 
realistic,  empirical,  multiband  models,  including 
the  tight-binding  model,  the  effective  bond  orbital 
model  and  the  k  •  p  method  [4]. 

Recently,  Chang  has  developed  the  planar-basis 
pseudopotential  band  structure  model  for  treating 
electronic  properties  of  surfaces  and  heterostruc¬ 
tures  [5,  6].  For  treating  structures  with  layer-like 
local  geometry,  this  model  is  considerably  more 
efficient  than  plane-wave  based  pseudopotential 


*  Corresponding  author.  Tel.:  +886.3.574.2518,  Fax:  +886.3.572.3052.  E-mail:  dzt@phys.nthu.edu.tw. 


507 


508 


G.-C.  LIANG  et  al 


methods,  and  leads  to  better  analysis  of  surface 
and  interface  properties.  Furthermore,  it  can  be 
constructed  directly  from  first  principles  and  thus 
offers  the  possibility  for  more  physically  accurate 
descriptions  than  could  be  obtained  with  empirical 
band  structure  models.  The  method  has  been 
successfully  applied  to  the  studies  of  electronic  and 
optical  properties  of  surface  and  interfaces.  We 
implement  MQTBM  for  this  new  band  structure 
model  so  that  it  can  be  used  to  treat  quantum 
transport  problems  as  well.  The  present  work 
differs  from  MQTBM  implementations  in  the  past 
in  that  the  basis  functions  used  are  non-orthogo- 
nal,  and  thus  introduces  some  new  complications. 

2.  METHOD 

A.  Planar-Basis  Pseudopotential  Method 
and  Planar  Wannier  Functions 

We  briefly  introduce  the  planar-basis  pseudopo¬ 
tential  method  and  planar  Wannier  functions,  and 
refer  the  readers  to  the  work  by  Chang  [5, 6]  for 
more  details.  In  the  planar-basis  pseudopotential 
method,  planar-basis  functions  consisting  of  2D 
plane  waves  in  the  x-y  plane  and  ID  gaussian 
functions  in  the  z  direction  are  used  in  hetero¬ 
structure  electronic  calculations  [5,  6].  The  method 
can  be  made  formally  equivalent  to  either  empiri¬ 
cal  or  self-consistent  first  principles  pseudopoten¬ 
tial  methods.  Using  these  basis  functions,  we  can 
find  Bloch  states  associated  with  band  n  and  wave 
vector  k ,  '^n.kiy)  and  use  them  in  turn  to  construct 
the  planar  Wannier  functions.  Assuming  the 
heterostructure  growth  direction  is  along  the  z 
axis,  a  planar  Wannier  function  is  a  translationally 
invariant  function  in  the  x-y  plane  characterized 
by  an  in-plane  wavevector  k\\,  and  centered  along 
the  growth  direction  at  Rz  : 

{r\Wnh{Rz))  =  Jdkze-ik^nM{r).  (1) 

For  a  fixed  value  of  k\\9  the  set  of  Wannier 
functions  with  same  band  index  n  and  centered  at 


different  Rz  may  be  thought  of  as  a  1-band  tight- 
binding  basis  which  can  be  used  to  calculated  the 
bulk  band  structure  for  the  nth  band.  However, 
the  interaction  among  those  functions  will  in 
principle  extend  to  large  distances,  which  makes 
computations  inefficient.  The  Generalized  Planar 
Wannier  Functions  (GPWF)  can  be  used  to  solve 
this  problem.  The  generalized  planar  Wannier 
functions  are  proper  linear  combinations  of 
WnM(Rz)  for  the  interested  bands  such  that  they 
are  more  localized  along  the  z  direction.  In  this 
new  sub-basis,  the  interaction  parameters  can  be 
cut  off  at  shorter  distances,  but  at  the  cost  of 
leaving  the  Hamiltonian  in  the  GPWF  sub-basis 
non-diagonal  in  band  index.  In  addition,  depend¬ 
ing  on  the  number  of  neighboring  interactions 
kept,  the  GPWF  sub-basis  could  be  either 
orthogonal  or  non-orthogonal. 

For  the  generalized  planar  Wannier  functions, 
the  elements  of  the  Hamiltonian  matrix  are  given 
by, 

HmA  k)  =  (2) 

Rz 

where  Wfm,k^  is  a  generalized  planar  Wannier 
function.  By  storing  the  interaction  terms 
(w'm,k{ft)\H\w'm\k^Rz))  up  to  the  desired  num- 
ber  of  neighbors  (two  for  non-orthogonal,  and 
four  for  orthogonal  basis)  for  each  fixed  value  of 
&||,  we  can  converted  the  problem  into  a  ID 
multiband  tight-binding  model. 

Let  d  be  monolayer  separation  along  the  growth 
direction.  We  can  then  write  Rz=  ad,  where  a  is  a 
integer  layer  label.  We  can  denote  the  generalized 
Wannier  planar  functions  with  the  following  tight- 
binding-like  simplified  notation  : 

k.k|>  =  l  W'm<h{R2  =  ad))  (3) 

where  m=  1,  2,...,  M  labels  the  basis  functions 
within  a  monolayer  ( M=  16  in  our  case).  We  then 
appeal  to  MQTBM  for  transmission  coefficient 
calculations. 
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B.  Transmission  Coefficient  Calculations 
in  Non-Orthogonal  Basis 

The  multiband  quantum  transmitting  boundary 
method  is  a  numerically  stable  and  efficient 
formulation  for  calculating  heterostructure  trans¬ 
mission  coefficients  using  realistic  multiband  tight- 
binding  band  structure  model.  As  seen  in  the 
previous  section,  the  Generalized  Planar  Wannier 
Functions  (GPWF)  may  be  viewed  as  a  non- 
orthogonal  tight-binding  basis  with  second-neigh¬ 
bor  interactions.  Previously,  MQTBM  has  been 
implemented  for  several  nearest-neighbor  tight- 
binding  models  with  orthogonal  bases.  The  gen¬ 
eralization  to  orthogonal  basis  with  multiple 
neighbor  interactions  is  straight-forward.  How¬ 
ever,  a  somewhat  different  formulation  is  required 
for  non-orthogonal  bases.  In  this  work,  we  will 
introduce  a  more  general  version  of  MQTBM 
capable  of  handling  multiple-neighbor  interactions 
in  a  non-orthogonal  basis.  We  consider  a  hetero¬ 
structure  as  consisting  of  an  active  region  sand¬ 
wiched  between  two  semi-infinite  flat-band 
electrode  regions.  The  computational  domain  con¬ 
sists  of  a  sequence  of  N  monolayers  parallel  to 
heterointerfaces.  It  includes  all  of  the  active  region, 
and  extends  out  from  both  sides  of  the  active  region 
to  include  2Ni  layers  of  each  of  the  flat-band 
electrodes;  Nr  is  the  number  of  neighboring  inter¬ 
actions  for  the  particular  tight-binding  model  used. 
As  will  be  been,  these  inclusions  are  sufficient  to 
allow  us  to  fully  incorporate  the  scattering  bound¬ 
ary  conditions  within  the  finite  computational 
domain. 

We  label  the  layers  in  the  computational  domain 
by  (7=1,  2, . . . ,  N.  For  a  second-neighbor  model, 
for  example,  a=  1,  2,  3,  4  will  be  in  the  left 
electfode,  a  =  N- 3,  N- 2,  N- 1,  N  the  right  and 
the  rest  are  the  active  layer.  From  this  point  on  we 
will  specifically  examine  the  case  of  a  second- 
neighbor  model  for  simplicity.  However,  all 
derivations  generalize  trivially  to  models  with 
larger  interaction  range. 

A  wave  function  in  the  tight-binding  basis  may 
be  written  as 


l*>  =  £  CCTm  | am,  *|,  >,  (4) 

am 

where,  again,  k\\  is  the  planar  orbital  in-plane  wave 
vector,  a  the  layer  label  and  m  the  orbital  label. 

The  Schrodinger  equation  (iY-^)|^)=0  in 
this  basis  may  be  written  as 

2Cr— 2  +  HGyG-\Ca„\  +  HG,GCG 

+  JV+lCx+1  +  Ha,a+2C<r+2  — 

where  CG  is  a  vector  of  length  M, 


(  C<j\  \ 


\CaM  / 


and  HG}Gf  are  M  x  M  matrices, 

&a}<7'  =  —  ESG}(Tf.  (7) 

The  elements  of  the  Hamiltonian  matrix  H  and 
overlap  matrix  S  are  given,  respectively,  by  : 

iam’  *|l H\°'m\  *|| >,  (8) 

and 

(s°y)m,m-=  (am’  *n>-  (9) 

For  convenience,  we  do  not  explicitly  show  the 
dependence  on  Ajj  and  E  in  our  notation. 

The  boundary  conditions  in  scattering  problems 
are  usually  described  in  terms  of  Bloch  states,  i.e., 
as  amplitudes  of  incoming  and  outgoing  plane 
waves.  To  translate  them  into  the  tight-binding 
basis,  we  make  use  of  the  transfer  matrix  and 
Bloch’s  theorem.  Together,  they  provide  a  connec¬ 
tion  between  the  plane-wave  and  the  local-orbital 
descriptions  through  the  concept  of  complex  band 
structures  [7]. 
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We  can  write  Eq.  5  in  the  transfer-matrix  form, 


Now,  consider,  plane-wave  state  in  either  of  the 
flat-band  electrode  regions, 

\Mk)  =  IV’/(*n,*i))  =  (ii) 

am 

Since  | ipi(k)  is  a  Bloch  state,  by  Bloch’s  theorem, 
the  tight-binding  coefficients  must  obey  the  rela¬ 
tion 

Ba  =  e^dBa- 1,  (12) 

where  d  is  the  distance  between  monolayers,  and 
A;±is  the  component  of  the  crystal  momentum 
along  the  growth  direction.  Then,  together  with 
Eq.  (10),  we  can  formulate  the  following  eigenva¬ 
lue  problem: 

-H-)a_2Ha,a-X  -H~y2H^+l 

1  0  0 

0  1  0 

0  0  1 


binding  basis  and  Bloch  (plane- wave)  states  [4]. 
We  sort  the  eigenvectors  such  that  the  first  2 M 
vectors  represent  state  which  propagate  or  decay 
to  the  right,  and  the  remainder  2 M  to  the  left.  The 
sorted  eigenvectors  are  then  grouped  in  a  Ax  AM 
matrix  D . 

Assume  that  we  have  a  known  coefficient  set  of 
incoming  plane  wave,  /,  and  unknown  coefficient 
sets  of  reflected  and  transmitted  parts,  r  and  t, 
where  /,  r  and  t  are  the  column  vectors  of  the 
length  2M.  Then  we  can  find  the  relations: 


Recall  the  implicit  dependence  of  Ha^>  on  E. 
Solving  the  above  yields  a  set  of  eigenvalues 
{< elkxd },  from  which  a  corresponding  set  of  com¬ 
plex  (£±)  values  are  readily  calculated.  E(k±)  is  the 
bulk  complex  band  structure  [7].  As  an  example, 
the  complex  band  structure  for  bulk  GaAs 
calculated  using  the  planar-basis  pseudopotential 
band  structure  model  is  shown  in  Figure  1. 

Each  of  the  AM  eigenvectors  contain  the 
coefficients  of  tight-binding  orbitals  on  the  one 
hand,  and  is  associated  with  a  particular  complex 
wave  vector  (&||,A:J_)  on  the  other.  In  this  way, 
they  provide  a  connection  between  the  local  tight- 


and 


(15) 


where  DL  and  DR  are  AM  x  AM  matrices  formed 
by  the  eigenvectors  obtained  by  solving  Eq.  (13), 
for  the  bulk  materials  making  up  the  left  and  right 
electrodes,  respectively,  and  the  Cj  is  a  vector  of 
length  2 M  constructed  by  joining  two  consecutive 
C  vectors  :  Cj  contain  Cj  and  C2 ,  Cj  contain  C3 
and  C4,  C(v/2)-i  contain  Cj\r_3  and  CN_2  and  CN/2 
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contain  CN_\  and  CN  instead.  Note  that  we  have 
divided  each  Z),y  matrix  into  2 Mx  2M sub-matrices 
for  convenience. 

Eliminating  r  and  t  from  Eqs.  (14)  and  (15),  we 
obtain 

Cl  -  d[2d^'c2  =  DLni- (16) 
and 

+  CNj2  =  0  (17) 

Combining  the  above  equations  and  Eq.  (5),  we 
obtain  MN  linear  equations  written  in  the  matrix 
form  as 


GaAs  Complex  Band  Structure 


Planar-basis  Pseudopotential  Method 


FIGURE  1  Complex  bands  of  GaAs  calculated  using  the 
planar-basis  pseudopotential  method. 
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(18) 


where  is  an  Mx  M  identity  matrix  and  the 
bracketed  terms  are  2 Mx  2 M  sub-matrices. 
Solving  the  matrix  equation  yields  a  set  of 
coefficient,  {Ci,C2,. . CV}.  By  Eq.  (15),  the 
coefficients  of  the  transmitted  plane-wave  states 
can  be  computed  by 

i=D*~lCm  (19) 

We  can  use  t  to  compute  the  transmission 
coefficient  :where  vI(Eik\\;L)  and  Vj(E,  kfR)  are 


the  velocities  of  the  incident  and  the  transmitted 
bulk  plane-waves,  respectively. 

3.  RESULTS  AND  DISCUSSION 

We  have  successfully  applied  the  multiband 
quantum  transmitting  boundary  method  to  tight- 
binding  band  structure  models  with  nonorthogo- 
nal  basis  and  multiple-neighbor  interactions. 
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Figure  2  shows  transmission  spectra  of  two 
different  GaAs/AlAs  double  barrier  structures 
calculated  for  using  the  Generalized  Planar  Wan- 
nier  Functions  (GPWF)  band-structure  model 
with  second-neighbor  interactions.  Both  T-  point 
and  the  X-point  resonant  tunneling  are  seen  in 
Figure  2.  The  T-  and  X-point  resonances  are 
associated  with  quasibound  states  localized  in  the 
GaAs  and  the  AlAs  layers  respectively.  The  two 
structures  used  have  the  same  GaAs  layer  widths 
(16  monolayers)  but  different  AlAs  layer  widths  (4 
and  8  monolayers).  Consequently,  the  positions  of 
the  T-point  resonances  remain  the  same  for  the 
two  structures,  while  the  X-point  resonances 
showed  a  definite  position  shift. 
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In  this  paper  we  use  the  code  Exemplar  for  matching  a  hydrodynamic  ID,  time- 
dependent  simulator  and  the  transport  coefficients  obtained  by  the  Monte  Carlo 
simulator  Damocles.  This  code  is  based  on  the  Least  Square  method  and  it  does  not 
require  any  a  priori  knowledge  about  the  simulator  (analytical  form  of  the  equations 
etc.).  The  stationary  electron  flow  in  a  one  dimensional  n  + -n-n+  submicron  silicon 
diode  is  simulated. 


Keywords:  TCAD,  VLSI,  BTE,  transport  theory,  fluid  mechanics,  electronic  devices 


1.  INTRODUCTION 

Electronic  transport  in  semiconductors  can  be 
described  by  hydrodynamic  models  (hereafter 
HM),  obtained  by  taking  the  moments  of  the 
Boltzmann  transport  equation  (hereafter  BTE): 
the  resulting  mathematical  model  consists  of  an 
infinite  hierarchy  of  Partial  Differential  Equations 
expressing  balance  laws  for  the  particle  number  n , 
velocity  u,  total  energy  E ,  deviatoric  stress  tensor 
0(,y),  energy  flux  S  (or  heat  flux  h)  and  so  on, 
coupled  with  the  Poisson  equation. 

Recently  Anile  and  Muscato  [1]  presented  an 
Extended  Hydrodynamic  model  where  the  closure 
of  the  moment  hierarchy  is  obtained  by  exploiting 
the  entropy  principle:  this  system,  which  is 


hyperbolic,  consists  of  13  scalar  equations  in  the 
13  unknowns  («,£,£, 0(ij),h)  and  completely 
determines  the  description  of  the  stress  and  of 
the  heat  flux,  at  variance  with  the  other  models. 
Such  a  model  has  been  tested  successfully  with 
Monte  Carlo  (hereafter  MC)  simulations  in  Silicon 

[2] ,  The  Left  H  and  Side  of  the  balance  equations, 
called  production  terms ,  represent  the  average  rate 
of  change  of  carrier  total  energy  gw,  momentum 
gp,  energy-flux  gs  and  stress  due  to  the 
scattering  of  carriers  with  the  lattice.  Usually  they 
are  approximated  with  ad  hoc  empirical  formula 

[3]  or  as  relaxation  terms  [4]:  this  last  approxima¬ 
tion  leads  to  a  serious  inconsistency  with  one  of 
the  fundamental  principles  of  Linear  Irreversible 
Thermodynamics,  the  Onsager  Reciprocity  Princi- 
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pie.  In  order  to  tackle  this  problem,  we  expanded 
the  electron  distribution  function  in  Hermite 
polynomials  around  the  state  of  global  thermal 
equilibrium  (limiting  ourselves  to  the  first  two 
polynomials),  where  the  electrons  lie  close  to  the 
bottom  of  the  conduction  band.  By  using  a  well 
known  procedure  due  to  Grad  [5]  we  obtained  the 
following  equations: 


2w  =  - 


E-E0 


Qv  -  ( al  +  aPiy  +  a2~ 
Jo  n0 


nu 


+ 


(*S  +  K  Jr +  *>;£) 


nS 


Qs  =  (4  +  aSi^r  +  af—)kBT0nu 
V  Jo  no) 

+(^+6^+Os 


(1) 

(2) 


(3) 


where  T0  is  the  room  temperature,  £0  the  lattice 
total  energy,  n0  a  reference  impurity  concentration 
(1018cm“3).  Since  in  this  approximation  the 
distribution  function  is  quasi-isotropic  [2]  the 
deviatoric  stress  tensor  6y  is  negligible  and  by  this 
method  we  cannot  extract  the  corresponding 
production  term  (unless  we  consider  higher  order 
Hermite  polynomials). 

For  the  sake  of  simplicity  we  modeled  the  stress 
production  as  a  relaxation  term: 

Qm  =  Jf-  (4) 

•a 

We  should  emphasize  that: 

•  the  coefficients  appearing  in  Eqs.  (1-4)  are  not 
fitting  parameters  but  rather  will  be  extracted  by 
MC  data  obtained  by  the  Damocles  code  [6],  in 
the  case  of  parabolic  spherical  band  approxima¬ 
tion,  in  order  to  be  consistent  with  the  Anile  and 
Muscato  model,  obtained  under  these  restric¬ 
tions; 


•  these  coefficients  are  not  functions  of  the 
positions  in  the  device  as  some  authors  claim. 

With  such  coefficients  we  obtained  a  closed  set  of 
hydro  equations  which  has  been  solved  by  an 
adequate  numerical  scheme.  Since  the  Monte 
Carlo  method  gives  a  stochastic  solution  of  the 
BTE,  the  results  are  noisy:  how  does  the  hydro 
solution  change  for  small  variations  of  the  para¬ 
meters  rw,  ra,  a0 ,  #i,  a2 ,  b0 ,  61,  b{l  In  order  to 
answer  this  question  the  simulator  Exemplar, 
developed  by  one  of  us  [7],  has  been  used. 

The  plan  of  the  paper  is  the  following:  in  section 
2  we  discuss  the  optimization  problem;  in  section  3 
we  simulate  the  usual  n+  ~n-n+  diode  with  the 
Damocles  code  and  with  our  Extended  hydrody¬ 
namic  model.  We  discuss  the  range  of  validity  of 
the  optimization  procedure  for  various  biases  and 
conclusion  are  drawn. 

2.  THE  OPTIMIZATION  PROBLEM 

The  optimization  problem  is  well  known:  defining 
the  residual  function 

R(a)  =  ^ ~y>t{Axua )  -  yif  (5) 

*‘=1 

where  a  is  the  vector  of  the  parameters,  f(x»  a)  is 
the  analytical  description  of  the  event  computed  in 
Xi  (modeled  by  the  vector  parameter  a ),  yt  the  real 
event  value  obtained  in  xt  and  wt  an  appropriate 
weight  factor.  In  a  mathematical  language  the 
problem  of  the  least  squares  method  could  be 
expressed  in  the  form: 

min {R(a)/a  e  Rn}  (6) 

where  n  is  the  dimension  of  the  vector  a  and  a  the 
solution  of  the  system,  generally,  non-linear 

WR(a)  -  0  (7) 

and  Rn  the  n  dimensional  space  of  the  real  R. 

If  we  know  the  analytical  equation  of f(x ,  a ),  we 
could  evaluate  and  resolve  the  system  S/R{a)  =  0, 
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using  one  of  many  methods  proposed  (e.g. 
Marquardt).  However,  if  we  do  not  know  the 
analytical  form  of f(x ,  a ),  it  is  not  possible  to  find 
the  solution  of  the  problem  (6)  finding  the  solution 
of  the  system  (7).  This  happens  when  we  want  to 
determine  a  set  of  parameters  to  model  a  general 
device  with  any  kind  of  simulator:  we  run  the 
simulator  obtaining  the  macroscopic  quantities 
but  not  the  derivatives  of  the  Residual  function.  In 
this  case  we  can  resort  numerical  methods  as  the 
simplex  method  and  Powell’s  method,  which 
minimize  the  Residual  function,  without  having 
to  know  its  derivatives. 

The  Exemplar  code  runs  the  simulator  to 
calculate  the  function  f(xh  a)  (i=  1,. .  .,N)  then 
compute  and  optimize  the  Residual  function  with 
the  Simplex  or  Powell  method  (according  the  user 
choice)  and  the  procedure  restarts  until  a  tolerance 
of  the  Residual  function  is  obtained. 

The  proposed  algorithm  can  be  used  with  any 
simulators  which  can  read  their  input  from  files 
and  store  their  output  in  column-formatted  ASCII 
files.  It  was  implemented  in  a  software  program 
with  a  friendly  user  interface  based  on  X-Window, 
which  resolves  the  problem  of  the  optimization 
well,  but,  since  during  the  optimization  it  runs  the 
simulator,  it  needs  a  large  computational  time. 


3.  SIMULATION  OF  A  ONE  DIMENSIONAL 
SILICON  DIODE 

The  n+  -n-n+  diode  consists  of  two  n+  regions 
0.1  pm -long  doped  to  a  density  of  N  =  1018  cm-3, 
while  the  central  n  region  is  0.3  pm  wide,  with  a 
doping  density  of  N  =  1016  cm-3.  We  consider 
Ohmic  boundary  conditions,  T0  =  300  K,  and  1  V 
of  applied  bias;  simulations  refer  to  the  stationary 
regime. 

In  Figures  1  and  2  we  plot  the  MC  data  for  Qv 
and  Qs  and  the  fitting  with  eqs.  (2)  and  (3)  (with 
ooo):  we  see  that  our  functional  form  fits  well  the 
data.  We  observe  that  the  MC  data  are  noisy 
especially  near  the  contacts  (this  run  needed  1 


FIGURE  1  Average  rate  of  change  of  carrier  momentum  Qp 
vs.  distance:  Monte  Carlo  data  (with  ***)  and  the  fitting  formula 
Eq.  (2)  (with  ooo). 


FIGURE  2  Average  rate  of  change  of  carrier  energy  flux  Qs 
vs.  distance:  Monte  Carlo  data  (with  ***)  and  the  fitting  formula 
Eq.  (3)  (with  ooo). 


month  of  CPU  in  a  IBM  Rise  6000  590)  and 
consequently  the  fitting  coefficients  are  not  unique. 

By  using  these  coefficients  the  Extended  Hydro- 
dynamic  model  is  solved  by  using  a  simulator 
based  on  the  splitting  method  between  relaxation 
and  convection  (Tadmor  scheme)  [8].  Then  we  run 
the  Exemplar  code  in  order  to  find  the  best 
coefficients  which  fit  well  the  MC  data.  In  Figures 
3,  4  and  5  we  compare  the  MC  data  (with  ***),  the 
hydro  data  without  optimization  (with  ooo)  ,  the 
hydro  data  optimized  with  Exemplar  (with  xxx). 
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FIGURE  3  Average  drift  velocity  vs.  distance  computed  by 
using  the  hydrodynamic  simulator  without  optimization  (ooo), 
the  hydro  data  optimized  with  the  Exemplar  code  (xxx)  and 
Monte  Carlo  data  (***),  with  1  Volt  bias. 


FIGURE  4  Average  electron  energy  vs.  distance  computed  by 
using  the  hydrodynamic  simulator  without  optimization  (ooo), 
the  hydro  data  optimized  with  the  Exemplar  code  (xxx)  and 
Monte  Carlo  data  (***),  with  1  Volt  bias. 

The  optimized  coefficients  (shown  in  Tab.  I) 
differ  from  the  previous  ones  at  maximum  by  40%. 
We  notice  that  the  well  known  ‘spurious’  peak  in 
the  velocity  curve  Figure  3  is  lowered  and  the 
energy  and  heat  flux  curves  Figures  4  and  5  are 
closer  to  the  MC  data.  However  this  ’spurious‘ 
peak  cannot  be  eliminated  by  any  small  change  in 
the  parameters  and  therefore  its  persistence  calls 
for  a  more  radical  examination  of  the  basic 
assumptions  of  the  model. 


FIGURE  5  Heat-flux  vs.  distance  computed  by  using  the 
hydrodynamic  simulator  without  optimization  (ooo),  the  hydro 
data  optimized  with  the  Exemplar  code  (xxx)  and  Monte  Carlo 
data  (***),  with  1  Volt  bias. 

A  crucial  question  to  be  addressed  is  what  is  the 
range  of  validity  of  the  optimization  procedure?  In 
order  to  answer  this  question  we  simulate  the 
device  with  the  coefficients  of  Table  I  (obtained 
with  1  Volt)  for  various  biases.  The  result  is  that  in 
the  range  0.8  -f  1.2  Volt  the  hydro  simulations  fit 
well  the  respective  MC  data  (see  Figs.  6,  7  and  8  in 
case  of  1.2  V):  for  higher  biases  the  hydro 
simulations  diverge  considerably  with  respect  to 
the  MC  data. 


FIGURE  6  Average  drift  velocity  vs.  distance  computed  by 
using  the  hydrodynamic  simulator  with  transport  coefficients  of 
Table  I  (obtained  with  1  Volt)  (ooo)  and  Monte  Carlo  data  (***), 
with  1.2  Volt  bias. 
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TABLE  I  Transport  coefficients  (in  psec  !),  channel  0.3  pm,  1  Volt  bias 


a0 

ai 

a2 

bo 

bi 

b2 

Qv 

-0.1376 

-2.3042 

1.3978 

-1.6078 

0.1930 

-0.00769 

Qs 

5.7677 

27.7312 

24.1639 

-21.7514 

-2.6697 

-3.5269 

rw 

0.4094  (in  psec.) 

To 

0.0712  (in  psec.) 

FIGURE  7  Average  electron  energy  vs.  distance  computed  by 
using  the  hydrodynamic  simulator  with  transport  coefficients  of 
Table  I  (obtained  with  1  Volt)  (ooo)  and  Monte  Carlo  data 
(***),  with  1.2  Volt  bias. 


FIGURE  8  Heat  flux  vs.  distance  computed  by  using  the 
hydrodynamic  simulator  with  transport  coefficients  of  Table  I 
(obtained  with  1  Volt)  (ooo)  and  Monte  Carlo  data  (***),  with 
1.2  Volt  bias. 

Finally  we  say  that  the  modeling  of  the 
production  terms  and  the  closure  of  the  hydro 
model  are  crucial  points.  They  should  be  obtained 
by  using  the  physics  (e.g.,  an  entropy  principle, 


expansions  of  the  distribution  function).  The 
transport  coefficients  can  be  obtained  either  by 
experiments  or  by  MC  simulations,  and  their 
validity  is  restricted  to  a  neighborhood. 

We  are  trying  to  improve  our  model  and  to 
simulate  2D  devices:  work  along  this  line  is  in 
progress  and  will  presented  elsewhere. 
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A  hydrodynamical  model  for  semiconductors  based  on  Extended  Thermodynamics  is 
presented  and  a  suitable  numerical  scheme  is  proposed. 
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1  INTRODUCTION 

Hydrodynamical  models  for  carrier  transport  in 
semiconductors  have  been  widely  investigated  by 
several  authors  [1,  2,  3,  4,  5,  6  and  al.].  Most 
hydrodynamical  models  are  based  on  phenomen¬ 
ological  assumptions  and  not  derived  from  a 
consistent  physical  model.  Here  we  present  a 
hydrodynamical  model  based  on  Extended  Ther¬ 
modynamics  which  is  the  appropriate  thermody¬ 
namic  theory  of  irreversible  processes  far  from 
thermodynamic  equilibrium.  The  model  is  ob¬ 
tained  from  the  set  of  moment  equations  of  the 
Boltzmann  transport  equation  for  electrons  at  the 
level  of  13  moments.  The  closure  relations  are 
obtained  in  the  framework  of  Extended  Thermo¬ 
dynamics  [7,  8]  by  employing  the  entropy  princi¬ 
ple.  The  balance  equations  for  the  macroscopic 


variables  form  a  hyperbolic  system  for  values  of 
heat  flux  and  stress  tensor  which  are  routinely 
encountered  in  the  simulation  of  semiconductor 
devices  of  interest  for  industrial  purposes.  In  the 
present  paper  we  model  the  productions  as 
relaxation  terms,  with  relaxation  times  functions 
of  the  moments. 

A  suitable  numerical  scheme,  developed  in  [10], 
has  been  successfully  applied  to  the  present  model. 
It  is  based  on  the  splitting  method  between 
relaxation  and  convection  and  allows  an  easy 
implementation  in  numerical  codes.  For  the 
convective  steps  we  used  the  Nessyahu-Tadmor 
[11]  scheme  for  quasilinear  hyperbolic  systems  in 
divergence  form.  It  is  a  second  order  shock 
capturing  method  and  does  not  require  the 
analytical  expressions  of  the  eigenvalues  and 
eigenvectors  of  the  Jacobian  matrix  of  fluxes. 
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The  latter  property  is  crucial  because  explicit 
expressions  for  the  eigenvalues  and  eigenvectors 
are  not  available  for  the  balance  equations  of 
Extended  Thermodynamics.  For  the  relaxation 
part  a  suitable  combinations  of  implicit  Euler  steps 
assures  a  global  second  order  accuracy. 

In  particular  we  present  the  simulation  of  a 
n  +  -n  —n+  silicon  diode. 

The  numerical  results  shows  a  uniform  accuracy 
in  all  the  fields  (energy  density,  velocity,  particle 
number,  heat  flux)  within  a  reasonable  degree  of 
tolerance  and  could  be  employed  for  a  systematic 
use  as  a  CAD  tool. 


2  MATHEMATICAL  MODEL 

In  the  one  dimensional  case  the  evolution  equa¬ 
tions  become  (see  [9]  for  the  complete  derivation) 
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In  the  previous  equations,  n  is  the  electron 
number  density,  v  the  mean  velocity,  p=  nkBT  the 
hydrostatic  pressure,  q  the  x-component  of  the 
heat  flux,  a  the  xx-component  of  the  viscosity 
tensor,  nt  the  effective  electron  mass  in  the 
parabolic  band  approximation  and  E  the  x- 
component  of  the  electric  field 

FJ± 

dx 5 
d^d> 

£fa2  =  -e(ND-NA-n), 

where  (j>  is  the  electric  potential,  Nu  and  NA  are 
respectively  the  donor  and  acceptor  density,  e  the 
elemetary  charge  and  e  the  dielectric  constant. 

W  is  the  electron  energy  density  W—  (1/2) 
«v2  +  (3/2 )nkBT  and  W0  ~  (3/2)  nkBT.  For  the 
energy  relaxation  time,  rW9  and  viscous  relaxation 
timei,  ra,  we  take  the  constant  values  tw  =  0.47  ps 
and  tv =0.02  ps.  The  relaxation  times  of  momen¬ 
tum  and  energy  flux  have  been  obtained  by  the 
homogeneous  Monte  Carlo  [12]  data  by  using  the 
following  fitting  formulas  (see  [9]  for  the  value  of 
the  constants  c,-  and  ct) 


s  s2 

(6) 

Tp  —  Cl  +  c2  —  +  C3  -JJ  , 

,  _  s  _  s2 

(7) 

Tg  =  C 1  +C2-+C3  ‘ 

System  (1-5)  has  the  structure  of  a  hyperbolic 
system  of  balance  laws.  Hyperbolicity,  however,  is 
restricted  to  a  suitable  neighborhood  of  thermo¬ 
dynamical  equilibrium  (see  [9]). 

Numerical  solutions  of  the  system  are  obtained 
by  a  splitting  scheme.  Let  us  consider  a  system  of 
the  form 


dv  dF(v) 
dt  dx 


=  G(v), 


with  veRm  and  F: Rm->  Rrn. 


(8) 
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The  splitting  scheme  is  based  on  the  solution  of 
the  two  steps: 

Convection  step 


dv  dF  A 


Relaxation  step 


Each  convection  step  is  a  predictor -corrector 
scheme  of  the  form 

vj+\/2  =  2  (Vj  ”+1  +  Vy+1  +  8  ^fj  "  v/;+1^ 
-a[f(v^;/2)-F(v;+i/2)], 

v;+1/2  =  v;-^f',.  (10) 

where  A  =  dtjdx.  The  time  step  A*  must  satisfy 
a  stability  condition 

A-max/j(i4(v(x,f)))  <1.  (11) 

where  p  (x4(v(x,0))  is  the  spectral  radius  of  the 
Jacobian  matrix, 


A 


dF 

dv' 


This  condition  will  ensure  that  the  generalized 
Riemann  problems  with  piecewise  smooth  data  at 
time  tn  will  not  interfere  during  the  time  step  At. 

The  values  of  v  fj /  Ax  and  F'j /  Ax  are  a  first 
order  approximation  of  the  space  derivatives  of 
the  field  and  of  the  flux,  computed  from  cell 
averages  by  using  Uniform  Non  Oscillatory 
reconstruction  (UNO), 


Vj  =  MM  (dHv  +  i MM(Dh ,  Dj) ,  dj+ iv 


-MM(Dj,DJ+l) 


MC  o,  H5NL  - 


FIGURE  1  Average  electron  energy  vs.  distance  obtained  with 
the  hydrodynamical  model  (continuous  line)  compared  to  Monte 
Carlo  results  (circles). 


MC  o,  H5NL- 


FIGURE  2  As  in  Figure  1  for  the  average  drift  velocity  vs. 
distance. 


where 

Dj  =  vj+i  -2vj  +  Vj- 1, 

4+1/2  =  Vy+1  -  Vj 

and 

MM(x,y)  = 

f  sgn(x)  ■  min(|x|,  |j[)  if  sgn(x)  =  sgn(>>) 
1 0  otherwise. 


(12) 


A  similar  procedure  is  used  for  computing  F'j. 
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For  the  relaxation  step  an  unconditionally 
stable  second  order  scheme  can  be  obtained  by 
analytical  integration  of  the  linearized  relaxation 
equation,  where  linearization  is  obtained  by 
freezing  the  coefficients  at  time  tn.  The  electric 
potential  is  computed  by  a  standard  procedure. 
In  order  to  obtain  second  order  accuracy  in  time 
we  combinine  the  two  steps  according  to  the 
following  scheme 


Ui 

=  Un-R(UhEn,At), 

U2 

- 

U3 

=  U2-R(U3,En,At), 

U5 

CaiU3, 

En+\ 

V(U 4), 

U»+i 

U4-R{Un+l,En+\, 

where  R  represents  the  discrete  operator  corre- 
spoding  to  the  relaxation  step,  C^t  is  the  discrete 
operator  corresponding  to  NT  scheme,  and 
V(U) gives  the  solution  to  Poisson’s  equation. 


The  initial  conditions  for  the  system  are; 

w(x,0)  =  r(*,0)  =  300  K,  v(x,0) 

=  0,  q(x ,  0)  =  0,  cr(x,  0)  —  0. 

The  doping  profile  is  regularized  according  to 
the  function 


/x  ,  /  ,  X  —  X\  ,  X  —  X2\ 

no{x)  =  no  —  do  tanh - tanh -  , 

\  s  s  / 

where  «0  =  «0(0),  d0  =  n0  (1  -  ND/  N£)/ 2, 
x\  =0.1  pm,  and  X2.  —  0.4pm. 

For  the  boundary  conditions  see  [9].  The 
stationary  solution  is  reached  within  a  few 
picosends.  The  numerical  results  obtained  for  the 
stationary  case  have  been  compared  with  the 
Monte  Carlo  results,  obtained  by  the  DAMO¬ 
CLES  code  [13]. 

Comparison  with  MC  simulations  shows  a  good 
overall  agreement.  In  particular  the  agreement  is 
very  good  everywhere  in  the  device  but  in  the 
second  junction,  where  the  discrepancy  with  MC 
simulations  can  reach  up  to  10%. 


3  NUMERICAL  RESULTS 

As  test  problem  we  consider  a  ballistic  diode  n+ -n 
n+ ,  which  models  a  MOSFET  channel.  The  diode 
is  made  of  silicon,  and  the  bulk  temperature  is 
supposed  to  be  300  K.  The  n  +  regions  are  0.1  pm 
long  and  the  channel  is  0.3  pm  long.  The  doping 
density  is  iVj  =  0.5  x  1017cm-3  in  the  «  +  regions 
and  Nd=  0.2x1016  cm-3  in  the  channel. 

For  the  electron  effective  mass  in  the  approxi¬ 
mation  of  parabolic  band  we  use  0.32  me 
when  me  is  the  electron  mass.  The  silicon  dielectric 
constant  is  given  by  e  =  ereo ,  where  er  =  1 1 .7  is  the 
relative  dielecric  constant  and  e0  =  8.85  x  10_18C/ 
Vpm  is  the  dielectric  constant  of  vacuum. 

The  initial  electron  temperature  is  the  lattice 
temperature  T0=  300  K  and  the  charges  are  at 
rest.  A  bias  voltage  of  1  Volt  is  applied,  and  this 
determines  a  charge  flux  in  the  semiconductor. 
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We  derive,  using  the  Entropy  Maximum  Principle,  an  expression  for  the  distribution 
function  of  carriers  as  a  function  of  a  set  of  macroscopic  quantities  (density,  velocity, 
energy,  deviatoric  stress,  energy  flux).  Given  the  distribution  function,  we  obtain,  for 
these  macroscopic  quantities,  a  hydrodynamic  model  in  which  all  the  constitutive 
functions  (fluxes  and  collisional  productions)  are  explicitely  computed  starting  from 
their  kinetic  expressions.  We  have  applied  our  model  to  the  simulation  of  some  one¬ 
dimensional  submicron  devices  in  a  temperature  range  of  77  -  300  K,  obtaining  results 
comparable  with  Monte  Carlo.  Computation  times  are  of  order  of  few  seconds  for  a 
picosecond  of  simulation. 


Keywords:  Entropy  maximum  principle,  electron  transport,  submicron  devices 


1.  INTRODUCTION 

Modeling  of  modern  semiconductor  devices  is 
currently  performed  by  means  of  two  distinct 
approaches,  kinetic  models  and  Fluid  Dynamic 
(FD)  models.  The  most  accurate  kinetic  descrip¬ 
tion  is  given  by  Monte  Carlo  methods,  which  can 
take  into  account  explicitely  both  the  band 
structure  and  the  various  scattering  phenomena 
[1,  2].  Other  kinetic  approaches  are  based  on  the 
choice  of  particular  forms  of  the  non-equilibrium 
distribution  function  of  carriers.  Common  exam¬ 
ples  are  the  simple  shifted  Maxwellian  [3]  or  an 
expansion  of  the  distribution  in  spherical  harmo¬ 
nics  [4].  The  cylindrical  symmetry  constraint  in 


momentum  space  and  the  reduced  number  of 
terms  of  the  expansion  that  can  be  practically  used 
do  not  permit,  however,  to  describe  transport 
properties  of  carriers  in  conditions  very  far  from 
equilibrium  [5].  The  FD  models  are  obtained 
considering  a  set  of  moments  of  the  Boltzmann 
Transport  Equation  (BTE).  These  models  need  the 
knowledge  of  constitutive  functions  (fluxes  and 
collisional  productions)  present  in  the  hierarchy  of 
equations,  that  are  usually  fixed  on  a  phenomeno¬ 
logical  basis,  introducing  parameters,  such  as  e.g. 
relaxation  times  and  transport  coefficients,  which 
have  an  unknown  dependence  on  the  geometry  and 
working  conditions  of  the  simulated  devices.  The 
presence  of  these  free  parameters  [6]  has  always  been 
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a  limit  to  a  practical  use  of  FD  models,  because,  in 
general,  they  are  determined  in  each  case  on  the 
basis  of  MC  simulations  or  experimental  data. 

We  have  developed  a  HydroDynamic  (HD) 
model  for  the  simulation  of  transport  phenomena 
in  semiconductors,  based  on  the  Entropy  Maxi¬ 
mum  Principle  (EMP).  Following  this  principle, 
we  find  the  distribution  that  makes  best  use  of  the 
‘information’  deriving  from  the  knowledge  of  a 
given  set  of  moments.  This  distribution  turns  out 
to  be  a  strongly  non-linear  function  of  the 
moments.  Given  the  distribution,  we  determine 
the  unknown  constitutive  functions  appearing  in 
the  hierarchy  of  the  equations  that  describes  the 
time  evolution  of  the  moments.  We  point  out  that 
the  computation  of  collisional  productions  is  then 
based  on  the  sole  knowledge  of  the  scattering 
kernels  and  the  physical  quantities  they  contain. 
Our  HD  model  is  then  fully  closed,  and,  contrarily 
to  other  HD  models,  does  not  contain  any  free 
parameter.  On  the  other  hand,  the  distribution 
function  we  obtain  has  no  particular  symmetry 
restrictions  and  is  fully  suitable  for  three-dimen¬ 
sional  models.  Its  strong  non-linearity  is  also 
capable  of  describing  transport  phenomena  even 
in  conditions  far  from  thermodynamic  equili¬ 
brium,  as  those  present  in  submicron  devices  with 
very  high  electric  fields  and  field  gradients 
{E~  105  V/cm ,E/{dE/dx)  ~  100A). 

2.  PHYSICAL  CHARACTERISTICS 

OF  THE  MODEL 

We  consider  here  a  HD  model  for  transport 
phenomena  in  silicon.  Our  main  purpose  in  the 
development  of  this  model,  has  been  to  test  how 
accurately  our  distribution  function  describes 
strong  non-equilibrium  conditions.  Therefore  we 
have  used  a  simplified  band  structure.  As  is  well 
known,  electrons  contributing  to  transport  are 
mainly  those  belonging  to  the  six  equivalent  X 
valleys  which,  up  to  an  energy  of  about  0.5  eV,  can 
be  considered  approximately  parabolic.  Electrons 
can  then  be  described  by  a  density  of  states  effective 


mass  m*  =  0.32me.  In  the  same  energy  range,  the 
main  scattering  phenomena  are  due  to  electron- 
phonon  interactions,  and  we  will  consider  inter¬ 
valley  transitions  caused  both  by  /  type  and  g  type 
phonons;  for  intravalley  transitions  we  will  consider 
scattering  with  acoustic  phonons,  which  will  be 
regarded  as  approximately  elastic.  For  the  evalua¬ 
tion  of  the  scattering  terms  we  have  used  the 
parameters  reported  in  [2].  We  will  show  (by  a 
comparison  with  MC  simulations  performed  under 
the  same  physical  approximations)  that  in  this  way 
it  is  possible  to  describe  accurately  some  simple  Si 
devices  even  in  conditions  very  far  from  thermo¬ 
dynamic  equilibrium.  An  extension  of  the  model 
toward  a  non  parabolic  band  structure  (for  Si  or 
other  semiconductors)  and  the  inclusion  of  further 
scattering  terms  (ionized  impurities,  polar  optical 
phonons,  . . .)  do  not  present,  however,  conceptual 
difficulties,  and  they  are  at  present  in  development. 

3.  MAXIMIZATION  OF  ENTROPY 

Let  /be  the  the  distribution  function  of  electrons 
in  phase  space.  The  entropy  density  is  defined  by 

h  =  -C  jf\og{f)dk,  (1) 

where  C  is  a  constant.  Given  a  set  of  kinetic 
quantities  ^u(fc),  we  can  calculate  the  correspond¬ 
ing  moments  FA  of  the  /  by  means  of 

Fa  =  JipA(k)fdk.  (2) 

We  search  the  distribution  /  that  maximizes  the 
entropy  density  (1),  under  the  constraints  given  by 
the  relations  (2).  For  this  problem  we  have  consi¬ 
dered  the  set  of  kinetic  quantities  [7]: 
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and  the  corresponding  moments:  density  of 
carriers  (n),  flux  of  carriers  (nvt),  total  energy 
density  ( W ),  traceless  part  of  momentum  flux 
density  (£<*/)),  energy  flux  density  (Si),  being 


but  we  show  here  one-dimensional  results.  The 
only  independent  variables  are  in  this  case  FA  — 
[n,  nvx=  nv,  W ,  E(XJC)  =  E,  Sx  =  S},  satisfying  the 
following  balance  equations 


W=\p  +  \nn?v2, 
s(y)  =  aW)  +  nm\ivj)> 

St  =  qi  +  +  §  pvt  +  \nm*v2Vi 

where  v,-  is  the  mean  velocity,  p  =  nKBT  is  the 
pressure,  is  the  non  convective  part  of  tensor 
E(y*)  and  qt  is  the  heat  flux.  Application  of  the  EMP 
requires  then  the  maximization  of  the  functional 

13 

h'  =  h-J2*A 

A= 1 

where  A^’s  are  Lagrange  multipliers.  The  resulting 
distribution  will  have  the  functional  form: 

f—  exp(— n),  n  =  Y,Mk)*A.  (3) 

A 

From  (2)  we  have  that  FA  —  FA( XB),  and  so,  to 
determine  /,  we  must  invert  (2)  obtaining  XA  = 
A a(Fb )•  This  inversion  is  extremely  difficult  and 
can  be  obtained  only  by  numerical  integration  or 
by  a  series  expansion  of /[8].  We  have  followed  the 
latter  approach,  expanding  /  to  third  order  in 
Fa  =  {v,  cr,  q}  around  an  equilibrium  configura¬ 
tion  defined  by  a  local  Maxwellian  fM.  Introducing 
this  expression  into  (2)  all  the  quadratures  can  be 
done  analitically,  and  the  resulting  relations  can  be 
inverted.  In  this  way  we  express  /  as 


J i)A(k)fdk  -  Fa 


dn  dnv 

a  +  ar  =  0' 


dm  1  a(E  +  (2/3)iy) 
dt  m*  dx 


neE 


nr 


dW 

dt 


dS 

—nveE  +  Pw 
dx 


dY,  d(Znm*v3  +  f  pv  +  ^  q  +  Gy) 

~dt+  dx 

-  -\nveE  +  P^, 


dS  :  d(±nm*v4 +^qv  +  4pv2 +^av2 +  GS) 
~dt+  dx 


=  -eE[±nv2+l-!-  +  — 
nf  nf 


+  Ps, 


where  EX  =  E  is  the  electric  field,  GA  =  {Gs,  G,s}  are 
terms  of  the  fluxes  dependendig  on  higher  order 
moments  and  PA  =  {Pnv,Pw,  Ps,  Ps }  are  the 
collisional  productions.  Such  unknown  constitu¬ 
tive  functions  have  been  determined  by  integration 
of  their  kinetic  expression  using  the  distribution 
function  given  by  the  EMP.  The  fluxes  and  the 
collisional  productions  are  very  complex  functions 
of  Fa  which  we  do  not  show  here. 


5.  DEVICE  SIMULATIONS 


f(r,t,k)  =f(FA(r,t),k), 
which  is  a  strongly  non-linear  function  of  FA. 

4.  THE  HYDRODYNAMIC  MODEL 

The  model  resulting  from  the  procedure  described 
in  the  previous  section  is  fully  three-dimensional, 


As  test  case,  we  have  considered  four  unipolar  and 
one-dimensional  devices  n+nn  +  ( labelled  A,  B,  C, 
D)  at  different  temperatures,  doping  and  applied 
biases,  as  summarized  in  Table  I.  All  the  results 
are  compared  with  a  MC  model  using  the  same 
physical  approximations. 

In  Figure  (1)  we  report  the  results  of  a  series  of 
simulations  for  device  A.  We  see  that  for  devices 
with  such  high  fields  and  field  gradients  a  strongly 
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TABLE  I  Devices  parameters 


Dev. 

Temp.  (K) 

Bias  (V) 

-  N+(cm-3) 

N  (cm  3) 

Chan,  (pm) 

A 

300 

2 

1019 

1017 

0.25 

B 

150 

1 

10'8 

1016 

0.25 

C 

77 

1 

lO17 

1015 

0.45 

D 

300 

1 

1018 

10'6 

0.3 

FIGURE  1  Velocity  and  total  energy  as  a  function  of  position 
for  device  A.  We  show  a  comparison  with  MC  data  and  results 
for  the  HD  model  with  production  terms  computed  at  different 
orders  of  the  expansion.  Differences  in  the  values  of  energy  in 
the  highly  doped  regions  are  due  to  the  correlation  energy  of 
electrons,  included  in  MC  model. 


non  linear  description  is  necessary  to  obtain  a 
good  agreement  with  MC  simulations.  In  Figures 
(2),  (3)  we  report  two  examples  of  computations 
at  lower  temperatures  (with  production  term 
evaluated  at  third  order).  Note  that  MC  data  for 
the  77  K  simulation  show  evident  oscillations  on 
both  velocity  and  energy.  This  phenomenon  is 
probably  due  to  single-phonon  interactions,  and  is 
partially  reproduced  by  our  model.  Note  also  the 
ballist  pick,  characteristic  of  low-temperature 
devices.  Figure  (4)  shows  velocity,  energy,  traceless 
stress  and  heat  flux  for  device  D.  Current  densities 
for  devices  A,  B,  C  and  electric  fields  for  devices  A, 
B,  D  are  shown  in  Figures  (5),  (6).  Computation 
times  for  the  present  results  have  been  in  a  range  of 
1-^10  sec  for  a  ps  of  simulation. 


FIGURE  2  Velocity  and  total  energy  for  device  B.  Results 
from  a  MC  simulation  and  the  HD  model. 


FIGURE  3  Velocity  and  total  energy  for  device  C.  Results 
from  a  MC  simulation  and  the  HD  model. 
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FIGURE  4  Velocity,  total  energy,  traceless  stress  and  heat 
flux  for  device  D  from  a  MC  simulation  and  the  HD  model. 


FIGURE  6  Electric  fields  for  devices  A,  B  and  D,  from  the 
HD  model. 


point  out  that  an  effective  use  of  the  EMP  depends 
both  on  the  choice  of  a  set  of  ‘constraints’ 
(moments  of  the  distribution  function)  and  on 
the  determination  of  a  set  of  evolution  equations 
for  these  ‘constraints’  that  takes  explicitely  into 
account  the  underlying  physical  processes  (the 
various  scattering  phenomena,  in  this  case). 

A  better  model  of  transport  in  semiconductors 
will  require  a  more  accurate  description  of  band 
structure  and  the  inclusion  of  further  scattering 
terms.  These  extensions  are  now  in  development. 
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FIGURE  5  Current  densities  J  for  devices  A,  B  and  C,  from 
MC  (points)  and  HD  (lines). 


6.  CONCLUSIONS 

We  have  shown  that  the  EMP  allows  to  create  a 
closed  HD  model  to  describe  transport  phenom¬ 
ena  in  Si  in  strong  non-equilibrium  conditions.  We 
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Modeling  of  Poly-Silicon  Carrier  Transport  with  Explicit 
Treatment  of  Grains  and  Grain  Boundaries 
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Explicit  treatment  of  grains  and  grain  boundaries  is  necessary  to  model  the  carrier 
transport  in  poly-silicon  devices  whose  feature  size  is  comparable  to  the  grain  size.  The 
grain  boundaries  were  modeled  by  interface  traps,  and  comparison  was  made  between 
thermionic  and  diffusion  transport  across  the  grain  boundaries.  It  was  found  that  the 
numerical  model  for  diffusion  transport  with  total  trap  conservation  in  grain  boundary 
areas  is  not  physically  convergent  and  shows  a  strong  grid  sensitivity.  Effects  of  the 
critical  doping  level  and  the  lattice  temperature  are  demonstrated  on  poly-silicon 
resistors  with  1-D  bamboo-type  and  2-D  realistic  microstructures. 


Keywords:  Microstructure,  poly-crystalline  silicon,  carrier  transport,  device  simulation,  interface 
trap 


Poly-silicon  devices  such  as  TFT  for  SRAM, 
LCD,  and  image  sensor,  and  high-value  resistance 
in  analog  circuit,  have  recently  received  much 
attention  in  consideration  with  its  good  integra¬ 
tion  capability  with  standard  CMOS  technology 
[1].  The  carrier  transport  in  poly-Si  is  convention¬ 
ally  modeled  like  a  uniform  Si  material  with 
distributed  trap  density  in  the  band  gap  [2]  related 
to  the  average  grain  size,  since  more  than  tens  of 
grains  usually  exist  within  the  device  area  of 
interest  [1].  However,  with  the  aggressive  scaling  of 
VLSI  technology  and  recent  use  of  laser  annealing 
[3],  TFT  for  SRAM  and  high  resistance  poly-Si 
resistors  may  have  only  a  few  grains  within  the 
device,  and  the  device  behavior  will  be  heavily 
dependent  on  the  process  conditions  owing  to  the 
microstructure  effects  [4].  Nonuniform  and  aniso¬ 
tropic  phenomena  from  the  limited  number  of 
grains  within  the  active  device  area  may  be 


important  and  explicit  treatment  of  carrier  trans¬ 
port  within  grains  and  across  grain  boundaries  will 
be  necessary.  Physical  position  and  size  of  grains 
in  2-D  can  be  obtained  by  AFM  (atomic  force 
microscopy)  and  EBIC  (electron-beam  induced 
current)  [5]  instrumentation. 

If  the  grain  size  in  poly-Si  is  still  larger  than 
100  A,  deviation  from  single-crystal  Si  transport 
(based  on  the  periodic  potential  approximation) 
within  grains  may  be  negligible.  Bandtail  transport 
[4]  within  grains  can  be  used  as  the  next  level  of 
approximation.  The  grain  boundaries  in  silicon  are 
usually  an  interface  between  different  crystal 
orientations  and  are  therefore  atomically  thin.  It 
is  impractical  and  unphysical  to  assign  nontrivial 
regions  on  the  order  of  50  A  [3]  to  model  the 
carrier  transport  across  grain  boundaries,  since  the 
drift-diffusion  model  is  doubtful  for  that  geome¬ 
trical  scale,  and  gridding  for  multi-dimension  cases 
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requires  a  large  number  of  grid  points.  Alterna¬ 
tively,  the  grain  boundary  can  be  modeled  by 
interface  traps  (no  physical  size  assigned  to  grain 
boundaries): 

V(eVV»  =  q{n-p-N%  +  N-A-N%D  +  NAA) 

(1) 

=  (CpnP  +  enn)(NDD  -  NU)  ^ 

oi  (2) 

—  (Cnnn  +  epn ) 

^LdA  =  {Cnp»  +  epp)(NAA-NAA) 

~  (CppP  +  enp)  N AA 

where  e  is  the  dielectric  constant,  is  the  potential, 
n  and  p  are  the  electron  and  hole  concentrations, 
Aj  and  NA  are  non-trapping  ionized  dopants, 
and  N^d  and  NAA  are  donor-  and  acceptor-state 
traps.  The  effect  of  traps  are  calculated  in  rate 
equations  (2)  and  (3),  and  then  fed  back  to  the 
Poisson  equation  (1).  The  transport  equations  for 
n  and  p  follow  the  conventional  forms.  The 
interface  trap  density  is  mapped  into  the  discreti¬ 
zation  scheme  of  (1)  by  multiplying  the  half  mesh 
length  of  the  closest  cell  to  convert  into  volume 
trap  density.  Carrier  transport  across  the  grain 
boundary  can  be  modeled  by  either  thermionic 
emission  or  diffusion.  The  former  approach  has 
been  taken  in  [7],  although  a  Monte  Carlo 
simulations  is  used.  The  drift-diffusion  model  is 
much  more  computationally  efficient  and  numeri¬ 
cally  stable,  and  thus  we  have  chosen  to  implement 
the  grain  boundary  treatment  under  the  drift- 
diffusion  model.  The  thermionic  emission  model 
assumes  two  distinct  distribution  function  at  either 
side  of  the  barrier,  and  the  carrier  flux  is  estimated 
by  the  population  which  has  energy  over  the 
barrier  height.  The  barrier  height  is  corrected  to 
account  for  image  force  and  field  emission 
(tunneling),  but  no  scattering  event  is  considered 
at  the  barrier.  The  diffusion  transport  across  the 
grain  boundaries  is  similar  to  the  case  of  delta- 


doped  traps,  i.e.,  transport  is  affected  by  the 
barrier  height,  but  scattering  mechanisms  are 
homogeneously  treated.  Detailed  model  descrip¬ 
tion  can  be  found  in  classical  textbook  such  as  [8]. 

A  poly-Si  resistor  with  1-D  bamboo-like  micro¬ 
structure  will  be  used  as  the  first  example  for 
explicit  treatment  of  grain  boundaries.  Three  one- 
micron-long  grains  and  two  grain  boundaries  are 
used.  The  grain  boundary  is  modeled  by  surface 
trap  concentration  Ns  of  1012cm”2.  The  critical 
doping  level  NT  [1]  will  be  Ns/X  =  1016cm“3, 
where  A  is  the  average  grain  size.  When  the  grain  is 
doped  with  less  than  NT ,  the  grain  will  be  fully 
depleted  and  the  conduction  band  will  have  no  flat 
region.  The  barrier  height  will  reach  a  maximum 
when  the  doping  level  is  close  to  NT-  When  the 
doping  level  exceeds  Nr ,  most  of  the  traps  are 
filled,  the  barrier  height  starts  decreasing,  and 
charge  neutral  regions  (flat  conduction  band)  grow 
from  the  center  of  grain  toward  the  grain 
boundary.  This  can  be  clearly  observed  in  both 
Figures  1  and  2,  where  thermionic  (la)  and 
diffusion  (lb)  transport  have  been  demonstrated 
respectively.  However,  when  diffusion  transport  is 
used  and  the  total  number  of  traps  is  conserved  in 
the  delta-doped  representation,  the  barrier  height 
is  strongly  dependent  on  grid  spacing  since  the 
depletion  width  does  not  vary  with  N$l,  but  with 
Nq°  5  for  abrupt  junctions.  This  indicates  serious 
deficiency  of  using  the  combination  of  interface 
traps  and  diffusion  model,  since  their  physical 
concepts  of  interface  condition  conflict  each  other. 
On  the  other  hand,  when  thermionic  transport  is 
used,  the  barrier  height  shows  no  grid  dependence 
for  10  A  to  100  A  minimal  grid  spacing,  (Fig.  2). 

As  shwon  in  Figure  3(a),  the  lattice  temperature 
effect  can  also  be  well  modeled  by  the  thermionic 
transport  as  compared  to  the  experimental  results 
in  [1].  When  the  barrier  height  is  large,  the 
resistivity  is  much  more  sensitive  to  the  lattice 
temperature  TL ,  since  the  thermionic  current  is 
proportional  to  T\  exp  (— Es/k  T l ),  which  is  the 
limiting  factor  in  the  resistor  studied  here.  In 
Figures  3(b-d)  where  the  minimal  grid  spacings 
are  100  A,  50  A  and  10  A,  the  diffusion  transport 
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FIGURE  1  Conduction  band  for  various  doping  levels,  (a)  by 
thermionic  emission  and  (b)  by  diffusion.  The  critical  doping 
level  is  around  10 16  cm-3. 


FIGURE  2  Grid  dependence  of  barrier  height  when  diffusion 
transport  across  grain  boundaries  is  used.  The  thermionic 
model  shows  no  grid  dependence  for  minimal  gridding  from 
100  A  to  10  A. 

shows  not  only  strong  grid  dependence,  but  also 
inaccurate  resistivity  transition  when  the  doping 
level  is  in  the  vicinity  of  NT.  It  can  be  observed 
that  the  resistivity  has  less  doping-level  depen¬ 
dence  owing  to  the  estimation  of  barrier  height, 
and  the  slope  of  resistivity  with  respect  to  1  /TL 
does  not  show  a  gradual  change  for  doping  level 
across  NT- 
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(d)  300/T  L 
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FIGURE  3  The  lattice  temperature  effects  on  poly-Si 
resistivity,  (a)  uses  thermionic  emission,  (b),  (c)  and  (d)  use 
diffusion  transport  with  minimal  grid  spacing  of  100  A,  50  A 
and  10  A  respectively. 

A  poly-Si  resistor  with  microstructure  similar  to 
that  obtained  from  EBIC  measurement  [5]  is 
shown  in  Figure  4.  Thermionic  emission  transport 


536 


E.  C.  KAN  AND  R.  W.  DUTTON 


has  been  used  since  grid  spacing  is  much  harder  to 
control  for  arbitrary  2-D  cases.  It  can  be  seen  that 
the  current  flow  pattern  is  strongly  affected  by  the 
grain  boundaries  when  the  doping  level  is  smaller 
than  Nt,  and  less  affected  when  the  doping  level  is 
larger  than  NT  owing  to  the  barrier  height. 
Stronger  depletion  can  be  observed  at  the  grain 
boundary  corners.  The  lattice  temperature  effect 
also  shows  more  complicated  behavior  in  Figure  5, 
which  apparently  can  not  be  predicted  by  simple 
analytical  models  [1].  This  can  be  explained  by  the 
dependence  of  2-D  current  flow  pattern  on  lattice 
temperature  and  barrier  height.  Detailed  analyses 
are  limited  by  the  length  of  this  paper,  and  will  be 
included  in  future  publications. 

We  have  reported  results  for  carrier  transport 
analyses  with  explicit  treatment  of  grains  and  grain 
boundaries.  As  the  number  and  orientation  of 
grains  become  influential  for  device  electrical 
behaviors,  our  approach  shows  good  promises 
for  predictive  simulation. 


FIGURE  4  Electron  concentration  and  current  flow  pattern 
of  a  2-D  poly-Si  resistor.  The  grain  boundaries  can  be  clearly 
observed  at  the  depletion  region,  (a)  is  doped  with  1016cm-3 
and  (b)  with  1017cm-3. 
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FIGURE  5  The  lattice  temperature  effects  on  poly-Si 
resistivity  with  a  2-D  microstructure. 
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We  present  a  multi-mode  drift-diffusion  equation  as  reformulation  of  the  Boltzmann 
equation  in  the  discrete  momentum  space.  This  is  shown  to  be  similar  to  the 
conventional  drift-diffusion  equation  except  that  it  is  a  more  rigorous  solution  to  the 
Boltzmann  equation  because  the  current  and  carrier  densities  are  resolved  into  M  x  1 
vectors,  where  M  is  the  number  of  modes  in  the  discrete  momentum  space.  The  mobility 
and  diffusion  coefficient  become  MxM  matrices  which  connect  the  M  momentum  space 
modes.  This  approach  is  demonstrated  by  simulating  electron  transport  in  bulk  silicon. 


Keywords:  Semi-classical  Boltzmann  equation,  non-equilibrium  transport,  multi-mode  drift- 
diffusion 


1  INTRODUCTION 

With  the  continued  down-scaling  of  semiconduc¬ 
tor  devices,  there  is  a  need  to  develop  device 
simulators  that  can  treat  carrier  transport  taking 
into  account  off-equilibrium  carrier  distributions 
by  solving  the  Boltzmann  Transport  Equation  as 
accurately  as  possible.  Several  techniques  have 
been  developed  to  do  so -such  as  the  Monte  Carlo 
[1],  hydrodynamic  [2],  spherical  harmonic  [3],  and 
cellular  automata  methods  [4],  and  the  Scattering 
Matrix  Approach  [5,  6,  7].  Each  method  has  its 
own  limitations -for  example,  computational  bur¬ 
den,  calibration  of  parameters,  low  order  approx¬ 
imation  for  the  distribution  function,  “artificial 


diffusion”  of  carriers  and  restriction  to  fixed 
spatial  square  grids.  Ideally,  a  simulation  method 
should  provide  all  the  capabilities  of  drift-diffusion 
simulators  (i.e.,  simulations  from  equilibrium  to 
high  bias  with  smooth  results  at  low  computa¬ 
tional  burden)  while  also  resolving  carrier  distribu¬ 
tion  and  treating  scattering  processes  rigorously. 
Our  objective  in  this  paper  is  to  take  a  step  in  this 
direction. 

Therefore,  here  we  will  describe  a  re-formula¬ 
tion  of  the  scattering  matrix  equations  which 
expresses  the  1-D  spatial  Boltzmann  equation  as 
a  1-D  spatial  drift-diffusion  equation  in  a  dis- 
cretised  3-D  momentum  space.  The  current  and 
carrier  densities  generalise  to  Mxl  vectors,  where 
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M  is  the  number  of  modes  in  the  discrete  3-D 
momentum  space.  The  mobility  and  diffusion  co¬ 
efficient  become  MxM  matrices  which  connect  the 
M  momentum  space  modes.  Solving  the  Boltz¬ 
mann  equation,  then,  reduces  to  solving  a  set  of  M 
coupled  drift-diffusion  equations  which  might  be 
done  by  a  generalisation  of  the  standard  techni¬ 
ques  for  solving  the  conventional  drift-diffusion 
Equation  [8]. 


2  FORMULATION  OF  THE  MULTI-MODE 
DD  EQUATION 

The  one-mode  method  of  McKelvey  [9,  10]  and 
similar  observations  by  Shockley  [11]  can  be 
generalised  to  M-modes  in  3-D  momentum  space 
and  this  gives  us  an  expression  for  the  differential 
flux  equations  in  1-D  real  space  for  the  M-modes 
in  momentum  space  as  follows  [7] 

Kl2(x)] 

[62M] 

(1) 

where  the  Mxl  vectors  (x)  and  T*(x)  are  the 
fluxes  discretised  in  positive  and  negative  direc¬ 
tions  of  momentum  px  respectively,  at  a  position  x. 

The  elements  of  the  (MxM)  differential  matrices 
[&/(*)]  can  be  interpreted  as  the  inverse  of  an  inter¬ 
mode  mean  free  path  in  presence  of  all  possible 
scattering  mechanisms.  In  general,  these  terms  can 
be  difficult  to  calculate  analytically  and  therefore 
we  use  an  indirect  procedure.  The  space-indepen- 
dent  differential  [£y]  matrices  are  calculated  from 
the  matrix  logarithm  of  the  transmission  matrix  of 
a  semiconductor  slab  divided  by  the  slab  thickness. 
The  transmission  matrix  itself  is  obtained  from  the 
scattering  matrix  calculated  by  Monte  Carlo 
techniques  [7].  All  possible  information  about  the 
underlying  physics  of  scattering  (band  structure, 
phonons,  ionised  impurities  and  electric  field)  that 
is  included  in  the  Monte  Carlo  simulation  is 
automatically  embedded  in  the  scattering  matrix 
and  hence  in  the  [£y]  matrices. 


Now  returning  to  Eq.  (1),  we  find  its  symmetric 
and  anti-symmetric  components  and  relate  each 
flux  Ji(x)  to  its  velocity  and  its  population 
density  w,(x).  Thus,  we  obtain  a  multi-mode  drift- 
diffusion  equation  and  its  associated  continuity 
equation: 

J{x)  =  e[n(x)]  n(x)£x  +  e[D{x))  ^/?(x),  (2) 

^J(x)  =  Kx)]7(x)  +  [/3(x)]«(x),  (3) 

where  the  diffusion,  inverse  Einstein  and  mobility 
matrices  are  defined  as  follows: 


[D(x)]  =  2{[ft,(x)]  +  Mx)]  +  [&(*)] 

+  [62M]}-,[V],  () 

(5) 

+  [&(*)] -[62(*)]}, 

[#*(*)]  =  P(*)][*r\  (6) 

and  [V]  is  a  diagonal  matrix  whose  elements  are  the 
mode  velocities. 

The  coefficient  matrices  for  the  continuity 
equation  are 

[«(*)]  =  |{[6l  (*)]  +  [fl2(*)]  ^ 

-K21WM62W]}, 

[£(*)]  =f{[6i«]-[62(*)] 

-[&(*)]  +  [&(*)]}  M.  1  j 

Equations  (2)  and  (3)  are  the  key  results  of  the 
paper.  Note  that  they  are  very  similar  to  the 
conventional  drift-diffusion  form.  This  multi- 
mode  drift-diffusion  equation  has  associated 
MxM  mobility  and  diffusion  matrices  that  depend 
only  on  the  scattering  mechanisms  and  momentum 
space  discretisation. 

It  is  important  to  note  that  the  field  dependence 
of  the  relevant  matrices  here.  The  discrete  form  of 
the  Boltzmann  Transport  Equation  [7]  indicates 
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that  each  of  the  differential  [£y  (*)]  submatrices  are 
linearly  dependent  on  the  field,  for  any  given 
orientation  of  the  field.  This  relation  makes  the 
[p(x)],  [T)(x)],  [a(x)]  and  [/?(*)]  matrices  straight¬ 
forward  to  calculate  once  the  invariant  zero-field 
[£#(*)]  matrices  and  the  field-coefficient  matrices 
are  known. 


3  RESULTS 

In  this  section,  we  present  some  preliminary  results 
for  (111)  electric  fields  in  bulk  Si.  A  simple 
spherical,  non-parabolic  energy  band  structure 
was  assumed  for  calculating  scattering  rates  and 
the  3-D  momentum  space  was  discretised  into  400 
modes  -  20  modes  in  transverse  ]  pt  |  and  20  modes 
in  longitudinal  (this  implies  20  /+  fluxes  for 
+px  and  20  J~  fluxes  for  - px ).  Scattering 
matrices  were  computed  by  Monte  Carlo  simula¬ 
tion  [5-7]  and  the  [£(/(*)]  matrices  were  extracted 
using  the  procedure  described  in  Section  2.  Higher 
levels  of  accuracy  with  respect  to  the  underlying 
band-structure  and  scattering  can  be  attained  just 
by  using  more  sophisticated  Monte  Carlo  techni¬ 
ques.  The  distribution  of  the  carriers  was  assumed 
to  be  uniform  across  the  modes,  which  is  adequate 
to  demonstrate  proof-of-concept  here. 

Having  obtained  the  [f,y(x)]  matrices,  we  then 
calculated  the  mobility  and  diffusion  matrices  and 
examined  their  structure.  These  are  shown  in 
Figure  1.  Calculating  these  matrices  involves 
taking  matrix  logarithms  and  inverses  which 
produces  a  large  number  of  elements  that  are 
numerically  near  zero.  If  we  ignore  these  small 
elements,  the  matrices  are  very  sparse  («  1-2% 
ignoring  elements  <0.1%  of  the  largest  element) 
and  have  a  simple  structure.  The  most  significant 
elements  of  the  mobility  matrix  are  the  diagonal 
elements  and  off-diagonal  elements  only  where  the 
/th  mode  is  connected  to  the  j  th  mode  by  field 
shift.  The  diffusion  matrix  is  strongly  diagonal  and 
has  significant  but  small  off-diagonal  elements 
only  where  the  /th  mode  is  connected  to  the  yth 
mode  by  scattering. 


FIGURE  la  Mobility  matrix  for  Si,  non-parabolic  spherical 
bands,  20  modes  in  longitudinal  px  (20  positive  and  20  negative 
fluxes)  and  20  modes  in  transverse  \pt |,  (111)  -lxlO4  V/cm. 


FIGURE  lb  Diffusion  matrix  for  Si,  non-parabolic  spherical 
bands,  20  modes  in  longitudinal  px  (20  positive  and  20  negative 
fluxes)  and  20  modes  in  transverse  \pt\,  (111)  — lxlO4  V/cm. 


In  order  to  test  the  formulation,  we  used  the 
above  matrices  to  simulate  electron  transport  in 
bulk  Si  with  (111)  electric  fields.  The  solution  in 
bulk  is  simple  because  there  are  no  spatial 
gradients  in  Eqs.  (2)  and  (3).  Substituting  the 
expressions  for  [a],  [/?]  and  [p]  from  Eqs.  (7),  (8) 
and  (6)  respectively,  we  get 


-[£n] 

[£12] 

'[V] 

[0]' 

.-fed 

[62] . 

JO] 

M. 

which  could  also  have  been  obtained  directly  from 
the  differential  flux  Eq.  (1). 


542 


K.  BANOO  et  al 


The  solution  for  the  above  Eq.  (9)  is  a  straight¬ 
forward  solution  to  a  null-space  problem  (using 
svd  factorisation  in  MATLAB)  and  it  gives  us  the 
complete  bulk  carrier  distribution  function  for  any 
field  Sx.  Some  calculated  bulk  distributions  for 
(111)  electric  fields  are  shown  in  Figure  2.  Taking 
the  average  of  the  mobility  and  the  diffusion 
matrices  over  the  carrier  distributions  so  obtained 


FIGURE  2a  Bulk  carrier  distribution  for  <  1 1 1)  - 1  x  102  V/cm 
in  Si. 


in  the  bulk, 

J  (10) 

1  1  i  j 

(D)  =  (11) 

1  1  i  j 

we  recover  the  values  of  macroscopic  mobility  and 
diffusion  coefficients. 

Plots  of  the  results  are  shown  in  Figure  (3),  and 
we  see  that  macroscopic  mobility,  diffusion  coeffi¬ 
cients  and  electron  temperature  so  obtained  have 
the  expected  behaviour  with  electric  field.  The  low- 
field  mobility  is  ^30%  too  low  because  of  the 
assumed  uniform  intra-mode  distribution.  High- 
field  results  which  do  not  suffer  from  this 
constraint  are  therefore  more  accurate -e.g. 
9.915xl06cm/s  at  —  lxl05V/cm.  Note  that  the 
field  dependence  of  (y)  and  (D)  is  a  consequence 
of  the  field  dependence  of  the  distribution  function 
in  this  approach. 

4  DISCUSSION 

To  illustrate  how  the  equations  would  be  solved 
under  spatially  varying  conditions,  we  present  a 
simple  linear  scheme  using  finite  differences.  By 


FIGURE  2b  Bulk  carrier  distribution  for  (111)  -1x10s  FIGURE  3a  Macroscopic  bulk  mobility  versus  (111)  electric 
V/cm  in  Si.  field  in  Si. 
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FIGURE  3b  Macroscopic  bulk  diffusion  coefficient  and 
electron  temperature  (calculated  from  Dj\i  —  k^Tt/e)  versus 
(111)  electric  field  in  Si. 


discretising  dJ(x)/dx  on  a  uniform  grid  of  size  h, 
we  obtain 

\  (X/+^)  “  ^(* _^))  =  +  10WMO. 

(12) 

In  the  simplest  case,  we  could  discretise  the  current 
equations  using  Eq.  (2)  and  finite  differences  as 
follows 

+  «[O0')]~P(>'+  l)  -»(<•-  1)). 

The  result  is  a  tridiagonal  matrix  form  for  the 
carrier  distribution  function  across  the  device 
(/  from  1  to  N  ) 

[C{i-  1  Mi  -  1)  +  [T>(/)]«(0 
+  [W(z  +  1)]  n(i  +  1)  =  0, 

where  the  elements  of  the  tridiagonal  matrix  are 
now  not  scalars  but  MxM  matrices  and  the 
variables  are  Mxl  vectors.  This  is  a  large  linear 


system  whose  solution  is  the  position  dependent 
carrier  distribution  function.  However,  the  Schar- 
fetter-Gummel  method  is  normally  the  preferred 
scheme  for  discretising  the  conventional  drift- 
diffusion  equation  because  it  is  stable  when  the 
potential  drop  between  adjacent  nodes  on  a  grid 
spacing  h  is  greater  than  2 kBTJe.  A  corresponding 
result  must  be  developed  for  the  multi-mode 
semiconductor  equations. 

With  regard  to  future  applications,  it  is  clear  to 
see  that  finding  the  transitions  rates  for  the  fluxes 
in  two  (and  three)  spatial  dimensions  will  give 
similar  differential  flux  equations  for  two  (and 
three)  spatial  dimensions.  Therefore  the  multi- 
mode  method  will  still  hold  in  the  most  general 
case  of  transport.  As  a  final  note,  we  should  point 
out  that  the  multi-mode  drift-diffusion/continuity 
equation  formulation  is  formally  equivalent  to  the 
differential  flux  equations,  Eq.  (1).  These  equa¬ 
tions  could,  alternatively  be  integrated  across  the 
device  in  order  to  solve  for  carrier  transport.  The 
numerical  advantages  of  one  form  over  the  other 
are  not  clear  yet. 

5  SUMMARY 

To  summarise,  we  have  presented  a  1-D  spatial 
multi-mode  drift-diffusion  equation  as  reformula¬ 
tion  of  the  1-D  spatial  Boltzmann  equation  in  a 
discrete  3-D  momentum  space.  Although,  the 
numerical  aspects  in  this  paper  were  not  optimised 
for  the  best  accuracy,  the  multi-mode  drift- 
diffusion  equation  was  solved  in  the  bulk  for  the 
carrier  distribution  function  and  all  the  macro¬ 
scopic  properties  that  are  incorporated  as 
phenomenological  models  in  conventional  drift- 
diffusion  were  shown  to  fall  out  as  a  natural 
consequence  of  solving  the  multi-mode  drift- 
diffusion. 

The  potential  of  this  method  lies  in  its  close 
connection  to  conventional  drift-diffusion -nota¬ 
bly  the  equivalence  of  a  one-mode  case  to 
conventional  drift-diffusion,  easy  reduction  to 
macroscopic  quantities  and  similarity  in  solution 


544 


K.  BANOO  et  al 


techniques.  The  key  issue  now  is  to  formulate  the 
problem  for  solution  on  a  general  2-D  spatial  grid. 
Both  deterministic  and  stochastic  solution  techni¬ 
ques  for  solving  the  resulting  equations  should  be 
examined.  From  the  results  so  far,  the  multi-mode 
drift-diffusion  formulation  of  the  Boltzmann 
equation  promises  to  be  a  powerful  approach 
and  may  overcome  some  of  the  limitations  of  the 
scattering  matrix  approach. 
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The  Poisson  equation  is  solved  in  a  rectangular  prism  of  semiconductor  with  the 
boundary  conditions  commonly  used  in  semiconductor  device  modeling.  There  is  a 
planar  heterojunction  inside  the  prism.  The  finite  difference  formulation  leading  to  a 
matrix  of  seven  diagonals  is  used.  The  3D  version  of  the  Stone’s  method  is  applied  for 
the  iterative  solution  of  the  matrix  equation.  The  nonlinear  dependence  of  the  carrier 
concentration  on  the  electrostatic  potential  is  taken  into  account.  The  heterojunction  is 
modeled  by  a  potential  jump.  The  advantages  and  limits  of  the  method  is  presented. 


Keywords:  Poisson’s  equation,  semiconductor  device  modeling,  Stone’s  method,  heterojunction 


PROBLEM  STATEMENT 

The  solution  of  the  nonlinear  Poisson’s  equation  is 
a  fundamental  task  in  the  modeling  of  semicon¬ 
ductor  devices  [1].  The  problem  to  be  examined 
was  formulated  in  Ref.  [2].  Figure  1  shows  a 
possible  realization  in  a  rectangular  geometry.  The 
prism  is  combined  from  two  different  semiconduc¬ 
tors  contacting  in  a  planar  heterojunction.  On  the 
lateral  faces  and  on  the  base  zero  Neumann 
boundary  condition  is  supposed.  On  the  upper 
surface  on  the  rectangular  spots  (these  are  the 
exposed  surfaces)  zero  Neumann  boundary  con¬ 
dition,  on  the  metallized  remaining  part  (this  is  the 
gate)  Dirichlet  condition  is  prescribed  respectively. 
Chen  and  Porod  [2]  solved  the  problem  by  a 
coupled  finite  element/boundary  element  method. 
The  drawback  of  this  method  is  that  it  demands  a 
matrix  bandwidth  reduction  and  a  Newton  itera¬ 
tion.  The  purpose  of  the  present  paper  to  solve  the 


problem  by  the  finite  difference  method  combined 
with  a  3D  version  of  the  Stone’s  [3]  method. 


NUMERICAL  TREATMENT 

A  standard  finite  difference  discretization  is 
applied  with  variable  grid  distances.  The  boundary 
conditions  can  be  taken  into  account  in  the  usual 
manner.  At  the  zero-Neumann  boundaries  mirror 
potentials  were  used,  while  at  the  Ith  surface  point 
where  the  potential  is  prescribed  to  be  $given  the 
equation  $;  =  $given  is  valid,  i.e.,  in  the  ith  row  all 
of  the  matrix  elements  are  zero  except  the  main 
diagonal  element,  which  equal  to  unity  and  at  the 
right-hand  side  of  the  equation  in  the  column 
vector  of  the  excitation  the  /th  element  is  $given- 
Continuously  numbering  the  grid  points  a  sparse 
matrix  with  seven  diagonals  is  resulted. 
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FIGURE  1  The  structure  to  be  analyzed.  The  upper  part  is  a 
top  view,  while  the  lower  part  is  a  side-view.  The  small  squares 
on  the  top  view  are  the  exposed  surfaces. 
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FIGURE  2  The  potential  jump  is  modeled  by  a  double  layer 
The  Gauss- theorem  is  applied  to  the  volume  element. 


calculated  as  the  potential  differences  divided  by 
the  appropriate  distances.  So  and  all  of  the  six 
neighbouring  potentials  appear  in  the  /h  equation 
as  in  an  “ordinary”  point  (without  surface  charge). 
The  given  charge  enclosed  by  the  volume  will  be  an 
element  of  the  right-hand  side  column  vector.  The 
distance  An  is  choosed  0,01th  part  of  the  neigh¬ 
bouring  (to  the  surface  perpendicular)  grid  dis¬ 
tances. 

The  treatment  of  the  potential  dependent  carrier 
density  is  according  to  [5].  This  is  fitted  itself 
naturally  to  the  iterative  procedure  of  Stone. 


The  heterojunction  is  characterised  by  a  poten¬ 
tial  jump  [4],  This  condition  must  be  built  into  the 
sytem  of  equations  so  that  no  excess  matrix 
elements  to  be  appeared.  We  enforce  the  potential 
jump  by  a  double  layer.  The  connection  between 
the  moment  of  the  doble  layer  and  the  given 
potential  jump  is 

a  An  =  eA4> 

where  a  is  the  surface  charge  density,  An  is  the 
distance  between  the  positive  and  negative  layer,  e 
is  the  permittivity  and  A3>  is  the  potential  jump 
characterizing  the  heterojunction  (Fig.  2).  Using 
the  Gauss-theorem  f.e.  for  the  upper  volume 
element  in  Figure  2  the  field  strengthes  can  be 


CONVERGENCE  PROPERTIES 

The  numerical  results  are  calculated  under  the 
following  data  (see  Fig.  1):  the  depth  of  the  heavily 
doped  semiconductor  S\  =  10  nm,  the  depth  of  the 
substrate  1s,2=110nm,  the  side  length  of  the 
squares  of  the  satellite  excited  surfaces 
.S3  =  10  nm,  the  side  length  of  the  rectangular  prism 
y4=  120  nm,  the  distance  between  two  satellite 
excited  surface  5’5  =  40nm,  the  side  length  of  the 
square  of  the  central  excited  surface  s6  =  10  nm,  the 
donor  density  of  the  heavily  doped  part 
VD=  1018cm"3,  the  acceptor  density  of  the  sub¬ 
strate  VA=1015cnT3,  the  voltage  characterizing 
the  Schottky  contact  is  0,8  V,  the  potential  jump  at 
the  heterojunction  is  0,3  V.  The  gate  voltage 
is  -IV. 
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When  the  number  of  unknowns  exceeds  a 
critical  level  the  convergence  speed  degrades 
drastically  (Fig.  3).  35  min  CPU  time  is  required 
for  the  1700  iteration  on  a  HP  9000  715/80 
workstation.  As  it  is  known  the  a  iteration 
parameter  of  the  Stone’s  method  is  an  important 
but  theoretically  not  well  established  quantity. 
Most  authors  using  the  method  have  their  own 
recipe  for  choosing  the  iteration  parameter.  We 
used  the  following  formula  proposed  by  Stone  [3] 

a  =  1  -  be 

where  e  varied  cyclically  with  the  values  0,6;  0,4; 
0,2;  0,8;  0,2;  1.  b  generally  equal  to  10“3  but  in  the 
case  of  the  curve  a)  in  Figure  3  we  had  to  change  it 
to  10“2  at  the  iteration  number  100  otherwise 
divergence  occured. 


RESULTS 

The  electron  potential  energy  was  calculated  at  the 
substrate  side  of  the  heterojunction  Figure  4  shows 
a  contour  plot,  while  Figure  5  a  surface  plot  of  the 


FIGURE  3  The  residual  versus  the  iteration  number.  In  the 
inlet  the  numbers  of  grid  point  can  be  seen. 


FIGURE  4  Contour  plot  of  the  electron  potential  energy. 
One  satellite  dot  and  a  quarter  of  the  central  dot  can  be  seen. 


FIGURE  5  Surface  plot  of  the  electron  potential  energy .  One 
satellite  dot  and  a  quarter  of  the  central  dot  can  be  seen. 


energy.  The  diagrams  presented  are  calculated 
under  the  grid  dimension  59x59x59^205  000.  In 
the  vertical  direction  in  the  heavily  doped  part  50 
grid  points  was  applied  uniformly  and  only  9  one 
in  the  substrate,  the  distances  of  the  latter  are 
increasing  as  a  geometrical  series.  Using  the 
symmetry  only  one  quarter  of  the  arrangement 
was  calculated. 
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By  the  method  the  ratio  of  the  depth  of  the 
central  and  satellite  valleys  and  the  absolute  depth 
of  the  valleys  can  be  calculated.  These  parameters, 
which  are  important  in  the  charging  and  rechar¬ 
ging  process  can  be  tailored  by  the  proper  choice 
of  the  geometry  and  the  gate  voltage. 


CONCLUSIONS 

The  electrostatic  problem  of  a  semiconductor 
quantum  device  with  exposed  surface  was  solved 
by  the  finite  difference/Stone’s  method. 

The  advantages  of  the  method  are:  simple 
programing,  low  storage  requirement,  the  iteration 
demanded  by  the  nonlinearity  can  be  naturally 
built  into  the  iteration  type  solution. 

The  drawbacks  are:  after  a  limit  the  convergence 
becomes  rather  slow,  there  is  no  possibility  to  the 
local  refinement  of  the  grid. 

The  application  of  a  more  advanced  iterative 
solver  [6]  is  under  development. 
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We  examine  an  alternative  layout  geometry  for  the  quantum  cellular  automata  (QCA) 
architecture.  In  the  traditional  QCA  geometry,  all  of  the  cells  are  placed  in  a  single 
plane,  so  that  each  cell  interacts  with  a  particular  neighbor  only  along  one  of  its  edges. 
By  rotating  the  cells  out  of  the  plane,  we  make  it  possible  for  neighbors  to  interact  along 
all  four  edges  at  once.  This  increased  interaction  leads  to  a  more  bistable  cell-cell 
response  function  and  a  50%  higher  excitation  energy.  We  also  present  a  majority  logic 
gate  designed  using  three-dimensional  cells. 


Keywords:  Quantum  cellular  automata,  computer  architecture,  quantum-dot  molecules 


1.  INTRODUCTION 

The  quantum  cellular  automata  (QCA)  architec¬ 
ture,  which  is  composed  of  arrays  of  Coulombi- 
cally  coupled  quantum  dot  molecules,  is  a 
revolutionary  new  paradigm  for  quantum  comput¬ 
ing  [1].  QCA’s  offer  new  possibilities  for  comput¬ 
ing  at  the  nanometer  scale,  since  they  can  be  used 
to  hierarchically  implement  any  logical  function 
[2-4]. 

In  this  paper,  we  present  an  alternative  layout 
geometry  for  the  QCA  architecture  by  rotating  the 
cells  out  of  their  plane.  In  the  next  section,  we 
briefly  review  the  QCA  paradigm  in  two  dimen¬ 
sions,  while  in  Section  3  we  examine  the  QCA 
architecture  in  three  dimensions.  By  generalizing 
QCA  cells  to  three  dimensions  and  computing  the 


resulting  cell-cell  response  curve,  we  show  that 
the  three-dimensional  cells  are  more  bistable  than 
the  traditional  two-dimensional  cells. 

In  Section  4,  we  examine  a  three-dimensional 
majority  logic  gate.  We  show  that  two-dimen¬ 
sional  and  three-dimensional  majority  gates  give 
identical  functional  results. 


2.  TWO-DIMENSIONAL  QCA  DEVICES 

Figure  1  shows  the  basic  building  block  in  the 
QCA  paradigm,  the  quantum  dot  molecule  cell. 
Four  quantum  dots  are  spaced  close  enough  for 
electrons  to  tunnel  between  neighboring  pairs  of 
dots.  Two  stable  states  are  possible  with  this 
configuration.  The  first,  corresponding  to  an 
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FIGURE  1  A  schematic  of  the  basic  QCA  cell.  Electrons 
interact  Coulombically  and  tunnel  among  the  four  sites. 
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FIGURE  2  The  cell-cell  response  function  of  two  neighbor¬ 
ing  QCA  cells.  The  bistable  nature  of  the  response  function 
allows  us  to  encode  a  binary  bit  in  each  cell. 


electron  occupying  the  upper  right  hand  and  lower 
left  hand  dots,  represents  a  polarization  of  + 1. 
The  other,  corresponding  to  an  electron  in  the 
upper  left  hand  and  lower  right  hand  dots, 
represents  a  polarization  of  —  1.  These  two 
polarizations  can  represent  binary  values  of  “1” 
and  “0”,  respectively. 

The  Hamiltonian  used  to  describe  each  cell 
includes  Coulombic  interaction  between  each  pair 
of  sites,  near-neighbor  tunneling,  next-near-neigh¬ 
bor  tunneling,  on-site  energies  and  on-site  char¬ 
ging  costs.  The  on-site  energy  of  each  site  will  be 
affected  by  the  polarization  of  its  neighboring  cells. 

Thus,  Coulombic  forces  cause  the  polarization 
of  one  cell  to  affect  the  polarization  of  neighboring 
cells.  The  cell-cell  response  curve  shown  in  Figure  2 
is  computed  by  solving  the  two-particle,  time- 
independent  Schrodinger  equation  for  the  first  cell 
in  the  presence  of  the  second  cell.  The  curve  shows 
that  the  response  of  such  cells  is  very  bistable, 
which  means  that  for  even  very  small  polarizations 
in  one  cell,  the  second  cell  is  completely  polarized. 
This  feature  is  very  attractive,  since  it  allows  these 
cells  to  encode  a  single  binary  bit  in  their  quantum 
state.  Furthermore,  when  propagating  signals,  the 
signal  level  is  restored  to  the  maximum  polariza¬ 
tion  at  every  stage. 

The  most  useful  logic  structure  in  the  QCA 
paradigm  is  the  three-input  majority  gate.  A 


majority  gate  is  one  in  which  the  output  agrees 
with  the  majority  of  the  inputs.  This  logic  structure 
is  essential  because  we  can  produce  both  AND  and 
OR  gates  from  it.  A  two-input  OR  gate  can  be 
produced  by  setting  one  of  the  three  inputs  equal 
to  1,  while  a  two-input  AND  can  be  produced  by 
setting  one  of  the  three  inputs  equal  to  0. 

Figure  3  shows  a  schematic  for  the  coplanar 
QCA  majority  gate.  In  this  figure,  inputs  A,  B  and 
C  are  used  as  inputs.  The  output  is  propagated 
down  the  line  of  cells  acting  as  a  binary  wire. 
The  ground  state  of  this  multiple-cell  structure  was 


Input  A 


FIGURE  3  The  coplanar  majority  voting  gate.  The  device  cell 
always  matches  a  majority  of  the  three  fixed  neighbors. 
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computed  by  a  self-consistent  solution  of  the 
Schrodinger  equation. 


3.  AN  ALTERNATIVE  CELL  GEOMETRY 

The  QCA  paradigm  presented  thus  far  has 
included  only  cells  lying  in  a  two-dimensional 
plane.  By  generalizing  the  architecture  to  three 
dimensions,  we  can  show  that  the  cell-cell  interac¬ 
tion  is  significantly  stronger  than  in  the  two- 
dimensional  case. 

A  schematic  of  our  three-dimensional  cell-cell 
layout  is  shown  in  Figure  4.  Again,  four  dots  are 
used,  but  in  this  case,  the  cells  are  facing  each 
other  so  that  all  four  of  the  dots  are  strongly 
interacting  with  the  neighbor. 

The  cell-cell  response  curve  of  this  layout  is 
shown  in  Figure  5.  The  intercellular  distance  was 
computed  such  that  the  centers  of  the  cells  were  at 
the  same  distance  as  the  centers  of  the  two- 
dimensional  calculation.  The  code  to  solve  the 
time-independent  Schrodinger  equation  was  gen¬ 
eralized  to  allow  for  Coulombic  interactions  in  all 
three  dimensions. 

It  should  be  noted  that  three-dimensional 
neighbors  tend  to  align  opposite  one  another,  so 
the  cell-cell  response  function  was  horizontally 


FIGURE  4  The  cell  layout  for  the  three-dimensional  cell-cell 
response  function  calculation. 


FIGURE  5  ►Cell-cell  response  function  of  the  three-dimen¬ 
sional  geometry.  In  addition  to  yielding  a  more  bistable 
response  function,  use  of  the  alternative  geometry  increases 
the  excitation  energy  by  50%. 


flipped  to  allow  easier  comparison  of  the  two 
curves.  As  with  the  “inverter  chain”  of  the  two- 
dimensional  QCA  devices,  this  anti- alignment 
behavior  can  be  overcome  by  using  an  even 
number  of  cells  to  transmit  information.  Of 
course,  this  will  also  make  the  implementation  of 
three-dimensional  inverters  quite  simple. 

From  the  curves  shown,  it  is  easy  to  see  that  the 
three-dimensional  curve  is  more  bistable  than  the 
two-dimensional  curve.  In  addition,  the  eigenvalue 
splitting  between  the  ground  state  and  the  first 
excited  state  is  50%  larger  in  the  three-dimensional 
cells  than  it  is  in  the  two-dimensional  cells.  This  is 
to  be  expected,  since  in  the  three-dimensional  case, 
all  four  dots  are  Coulombically  interacting,  while 
in  the  two-dimensional  case,  only  the  near¬ 
neighbor  dots  strongly  interact  with  each  other. 

Since  the  excitation  energy  between  the  ground 
state  and  the  excited  states  is  the  quantity  that 
controls  the  time-dependent  evolution  of  the 
system,  we  can  expect  that  this  alternative 
geometry  will  propagate  signals  more  quickly  than 
the  two-dimensional  cells.  In  addition,  the  thermo¬ 
dynamic  behavior,  the  adiabatic  switching  speed, 
and  the  maximum  operating  temperature  are  also 
dependent  on  the  excitation  energy,  so  similar 
improvements  can  be  expected  in  those  areas,  as 
well. 
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4.  A  THREE-DIMENSIONAL 

MAJORITY  GATE 

As  mentioned  in  the  previous  section,  the  most 
important  logical  device  of  the  QCA  paradigm  is 
the  majority  logic  gate.  Figure  6  demonstrates  the 
layout  of  a  QCA  majority  gate  in  three  dimen¬ 
sions.  Cells  1  and  3  are  coplanar  to  cell  4,  so  they 
tend  to  drive  it  to  match  their  polarization.  On  the 
other  hand,  cell  2  drives  cell  4  to  an  opposite 
polarity,  since  it  interacts  using  the  three-dimen¬ 
sional  cell-cell  response  function.  This  can  be 
cancelled  by  using  an  extra  cell  in  that  input,  or  it 
can  be  used  to  implicitly  invert  that  input. 

In  order  to  ensure  that  the  three  inputs  “vote” 
with  equal  weighting,  the  distances  between  cells 
were  chosen  such  that  when  all  four  neighbors  of 
the  device  cell  are  voting  in  opposite  directions,  the 
device  cell  exhibits  no  polarization. 

To  find  this  distance,  the  three  coplanar  cells 
(the  device  cell  and  two  inputs)  were  placed  a 
distance  60  nm  apart  from  each  other  and  the  two 
coplanar  drivers  were  set  with  P=  + 1.  The  other 
two  neighbors  were  then  set  with  P=  — 1  and 
moved  away  from  the  device  cell  until  it  exhibited 
zero  polarization.  At  that  point,  we  were  certain 
that  the  horizontal  and  vertical  neighbors  were 
having  the  same  impact  on  the  polarization  of  the 
device  cell.  The  distance  between  the  non-coplanar 
cells  at  which  this  occurred  was  1.7435  times  larger 


Cells 


FIGURE  6  A  schematic  representation  of  the  three-dimen¬ 
sional  majority  logic  gate.  Cells  1,  2  and  3  are  drivers,  while  cell 
4  is  the  device  cell  and  cell  5  is  the  output.  Cells  2  and  4  tend  to 
anti-align,  so  the  input  at  cell  2  is  implicitly  inverted. 


than  the  distance  between  the  coplanar  cells,  since 
the  non-coplanar  cells  interact  more  strongly  with 
each  other. 

We  used  the  Intercellular  Hartree  Approxima¬ 
tion  to  find  the  ground  state  of  this  multiple-cell 
system.  The  three  input  cells  were  fixed  at  a  specific 
polarity  (either  +1  or  -1),  while  the  device  cell 
and  the  output  cell  were  allowed  to  change. 

We  tested  each  of  the  eight  combinations  of 
driver  polarizations  and  found  that  in  each  case, 
the  ground  state  corresponds  to  the  correct  result 
for  a  majority  logic  gate.  The  state  of  the  output 
always  matches  the  state  of  a  majority  of  the  three 
inputs,  assuming  the  non-coplanar  input  has  been 
implicitly  inverted. 


5.  CONCLUSIONS 

We  have  proposed  an  alternative  geometry  for  the 
QCA  computing  architecture  and  we  have  shown 
that  this  geometry  achieves  a  better  cell-cell 
response  than  the  two-dimensional  implementa¬ 
tion.  We  have  also  shown  that  an  inverter  and  a 
majority  logic  gate  can  be  constructed  three- 
dimensionally,  so  any  combinational  function 
can  be  constructed  using  this  alternative  geometry. 
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Electrostatic  Formation  of  Coupled  Si/SiC>2 
Quantum  Dot  Systems 

PER  HYLDGAARD*,  HENRY  K.  HARBURY  and  WOLFGANG  POROD 

Department  of  Electrical  Engineering,  University  of  Notre  Dame,  Notre  Dame,  IN  46556 


We  present  three-dimensional  numerical  modeling  results  for  gated  Si/Si02  quantum 
dot  systems  in  the  few-electron  regime.  In  our  simulations,  the  electrostatic  confining 
potential  results  from  the  Poisson  equation  assuming  a  self-consistent  Thomas-Fermi 
charge  model.  We  find  that  a  very  thin  Si02  top  insulating  layer  allows  an  effective 
control  with  single-electron  confinement  in  quantum  dots  with  radius  less  than  lOnm 
and  investigate  the  detailed  potential  and  resulting  charge  densities.  Our  three- 
dimensional  finite-element  modeling  tool  allows  future  investigations  of  the  charge 
coupling  in  gated  few-electron  quantum-dot  cellular  automata. 


Keywords:  3D  simulation  and  modeling,  finite  element  method,  silicon/silicon-dioxide  quantum 
dots,  quantum-dot  cellular  automata 


We  present  numerical  simulations  for  electrostati¬ 
cally  confined  few-electron  quantum  dot  systems  in 
the  technologically  important  Si/Si02  material 
system.  Our  emphasis  is  modeling  a  possible  so- 
called  Quantum-dot  cellular  automata  (QCA)  struc¬ 
ture  [1]  in  which  a  bi-stable  occupation  by  two 
excess  electrons  in  a  small  and  strongly  charge- 
coupled  quantum-dot  system  defines  logic  0/1. 

The  bottom  panel  of  Figure  1  shows  a  schematic 
of  the  Si/Si02  material  system:  a  thin  silicon- 
dioxide  layer  serves  as  excellent  insulation  of  the 
bottom  silicon  slab  from  the  set  of  top  gates. 
Applying  finite  biases  at  these  gates  allows  the 
formation  of  electrostatically  confined  quantum 


dots  just  below  the  heterostructure  interface  (at 
z  =  z0).  Mesoscopic  transport  investigations  in 
gate-induced  quantum-dot  arrays  [2]  documents 
the  feasibility  of  fabricating  few-electron  quan¬ 
tum-dot  systems  in  the  Si/Si02  material  system. 
This  development  has  in  turn  resulted  in  a 
proposal  for  room-temperature  single-electron  Si/ 
Si02  memory  cells  [3]. 

Previous  (two-dimensional)  modeling  results 
[4, 5]  of  few-electron  Si/Si02  quantum  dot  systems 
exploited  an  axial  symmetry  to  investigate  the 
electrostatic  confinement  within  an  individual  dot. 
Encouraged  by  recent  three-dimensional  modeling 
of  larger  quantum-dot  systems  [6]  we  investigate 
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FIGURE  1  Schematics  of  possible  gate-confined  Si/Si02 
quantum-dot  cellular  automata  (QCA)  The  lower  panel  shows 
material  composition:  thin  (5  nm)  Si02  insulating  layers  grown 
on  a  Si  block  with  an  unintentional  p  —  1015  cm-3  /7-doping.  Top 
gates  define  quantum  dot  system  just  below  the  hetero-interface 
at  z  —  zq.  The  upper  panel  illustrates  potential  QCA  cell 
formation:  group  of  four  positive  small  attractor  gates  (shaded 
circles)  from  charge-coupled  system  of  close  quantum  dots.  The 
entire  QCA  cell  is  surrounded  by  negative  depletion  gate  (also 
shaded)  to  enhance  confinement. 


here  a  possible  realization  of  a  Si/Si02  quantum- 
dot  cellular  automata. 

The  upper  panel  of  Figure  1  shows  schematics  of 
the  small  but  experimentally  accessible  top-gate 
geometry  assumed  for  our  model  Si/Si02  QCA 
system.  A  negative  depletion  gates  (shaded)  sur¬ 
rounds  a  group  of  four  positive  attractor  gates 
(shaded  circles)  with  a  r  =  10  nm  radius  and  mutual 
separation  of  30  nm.  The  attractor  gates  are  biased 
to  about  1 V  to  ensure  a  single-electron  equilibrium 
occupation  in  each  of  the  four  electrostatically 
confined  quantum  dots.  For  the  top  silicondioxide 
layer  we  assume  a  5  nm  thickness. 


In  our  T=100K  simulations,  the  confining 
potential  is  obtained  from  the  Poisson  equation 
within  a  self-consistent  Thomas-Fermi  charge 
model.  The  silicon  is  assumed  to  have  a  small 
unintended  but  fully  ionized  doping,  p=  1015cm“3 
and  to  ensure  convergence  we  investigate  a  1.5  nm 
thick  bottom  slab  with  in-plane  extension  of 
approximately  300  nm  by  300  nm.  The  top  metal 
depletion  and  attractor  gates  are  described  by 
Dirichlet  boundary  conditions.  For  the  exposed 
Si02  surface  we  assume  for  simplicity  a  potential 
fixed  at  the  mid-gap  Si02  value,  that  is,  again  a 
Dirichlet  boundary  condition. 

Our  finite-element  calculation  uses  a  129  by  129 
nonuniform  grid  to  allow  a  1  nm  resolution  from 
the  surface  and  well  below  the  Si/Si02  interface, 
that  is,  around  the  quantum  dot  system.  The  top 
panel  of  Figure  2  shows  how  most  nodal  layers  (at 
constant  z)  are  connected  in  a  mesh  with  alternat¬ 
ing  tetrahedron  orientation  to  eliminate  a  geome¬ 
trical  bias.  The  bottom  panel  of  Figure  2  illustrates 
the  repeated  thinning  of  our  finite-element  mesh 
undertaken  deep  below  the  interfaces  where  a  high 
in-plane  resolution  is  no  longer  needed.  However, 
our  numerical  simulation  still  involves  6xl05 
nodal  points  for  which  we  determine  the  electro¬ 
static  potential  within  the  self-consistent  Thomas 
Fermi  screening  model.  Using  the  Newton-Ralph- 
son  procedure  we  solve  in  each  iteration  the 
resulting  huge  linear  system  using  a  quasi-minimal 
residual  implementation  [7]. 

Figure  3  shows  our  finite-element  determination 
of  the  electron  potential  and  charge  distribution 
along  the  interface  (z  =  z0)  for  our  possible  QCA 
realization.  The  electron  confinement  potential 
{top  panel)  is  calculated  relative  to  the  Fermi 
energy  with  negative  values  corresponding  to  a 
strongly  enhanced  electron  concentration  ne]  ( bot¬ 
tom  panel).  Note  how  the  thin  silicon-dioxide  top 
layer  allows  a  very  well-defined  set  of  quantum 
dots  with  a  finite  inter-dot  potential  barriers.  The 
total  equilibrium  occupation  is  set  to  exactly  four 
electrons  and  the  strong  Coulomb  blockade  effect 
will  prevent  multiple  occupation  of  the  individual 
quantum  dots. 
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FIGURE  2  Three-dimensional  finite-element  modeling  of 
gated  Si/Si02  quantum-dot  systems.  The  upper  panel  shows 
tetrahedron-based  mesh  with  alternating  cell  orientation  used  in 
most  in-plane  layers  (i.e.,  with  constant  z)  to  eliminate  geo¬ 
metric  bias.  The  lower  panel  illustrate  repeated  mesh  thinning  at 
layers  deep  below  the  interface  (z  <C  z0)  used  to  obtain  a  factor- 
of-two  reduction  in  numerical  complexity. 


Figure  4  shows  our  modeling  results  for  the 
simpler  double-quantum-dot  system  in  which  we 
are  preparing  to  investigate  the  mutual  charge 
coupling  between  the  quantum  dots.  We  assume 
again  r=10nm  attractor  gates  with  a  mutual 
separation  of  30  nm  and  adjust  the  positive  bias  to 
achieve  a  single-electron  equilibrium  occupation  of 
each  of  the  quantum  dots.  The  upper  panel  shows 
the  variation  of  the  confinement  potential  both 
along  the  axes  (x)  connecting  the  two  quantum 
dots  and  in  the  growth  direction  (z).  The  hetero¬ 
structure-cut  panel  illustrates  the  excellent  top- 
gate  control  of  the  electrostatic  potential  into  the 
Si/Si02  slab  well  below  the  heterostructure  inter¬ 
face,  z  =  z0. 

The  lower  panel  of  Figure  4  shows  the  corre¬ 
sponding  equilibrium  charge  distribution,  nQ\. 


Interface  Potential 


In-Piono  dir.  x  [ran] 


FIGURE  3  Electron  potential  and  charge  distribution  of 
possible  QCA  cell  at  the  heterostructure  interface  (z  =  z0). 
Potential  is  calculated  for  an  experimentally  accessible  gate 
geometry:  four  attractor  gate  of  radius  r=  lOnm  with  mutual 
30  nm  separation.  The  upper  panel  shows  potential  dips 
(measured  relative  to  Fermi  energy)  with  corresponding 
contour  plot  and  illustrates  the  crisp  gate  control  allowed  by 
the  thin  Si02  top  layer.  The  electron  potential  confines  in 
equilibrium  exactly  four  electrons  with  very  well-defined  charge 
distribution  ne]  ( lower  panel). 

Note  that  this  electron  distribution,  i.e.,  the 
equilibrium  quantum  dot,  is  confined  within  1  nm 
of  the  interface  and  about  5  nm  of  the  attractor- 
gate  center.  Future  modeling  will  investigate  the 
charge  coupling  of  such  quantum-dot  disks  in  the 
presence  of  the  attractor  and  depletion  gates. 

In  summary,  we  have  presented  three-dimen¬ 
sional  finite-element  calculations  for  gate-confined 
few-electron  Si/Si02  quantum-dot  systems.  We 
have  documented  the  feasibility  of  crisp  electro¬ 
static  gate  control  for  also  few-electron  quantum- 
dot  systems  and  have  investigated  the  detailed 
charge  distribution  and  confinement  potentials. 
Our  modeling  tool  allows  future  investigations  of 
charge  coupling  in  few-electron  quantum-dot 
cellular  automata  structures. 
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FIGURE  4  Growth  direction  (z)  variation  of  confinement 
potential  {upper  panel )  and  corresponding  (equilibrium)  charge 
distribution  (lower  panel )  in  simpler  double-dot  model  system. 
This  system  is  defined  by  two  r  =  10  nm  attractor  dots  also  with 
a  mutual  separation  of  30  nm.  All  z-positions  are  given  relative 
to  the  interface  z  =  z0.  Note  that  the  equilibrium  electron 
distribution  ne]  is  confined  within  1  nm  of  the  interface  and 
about  5  nm  of  the  attractor-gate  center. 
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Electron  transport  properties  of  quantum  wires  in  the  presence  of  a  periodically 
modulated  magnetic  field  are  investigated.  For  a  short  modulated  wire,  we  find  dips  in 
conductance  just  below  each  mode  threshold.  The  conductance  dips  are  quite  robust  at 
low  temperature.  Increasing  the  number  of  periods  of  magnetic  modulation  can  lead  to 
the  formation  of  minibands  and  gaps.  The  differences  between  the  one  dimensional  (ID) 
electric  superlattice  and  ID  magnetic  superlattice  are  discussed.  We  also  consider  the 
spatial  distributions  of  currents,  which  show  dramatic  differences  between  the  magnetic 
superlattices  and  electric  ones. 


Keywords:  Two-dimensional  electron  gas,  magnetotransport,  nanostructure,  quantum  wire, 
superlattice,  miniband 


Recently,  much  attention  has  been  paid  to  electron 
transport  in  magnetic  fields  that  are  inhomogeneous 
on  the  nanometer  scale.  Experimentally,  such  mag¬ 
netic  modulation  can  be  realized  by  laying  metallic 
strips  on  the  surface  of  the  heterostructure,  consist¬ 
ing  of  either  superconducting  or  ferromagnetic 
material  [1,  2,  3],  or  by  varying  the  topography  of 
a  non-planar  two-dimensional  electron  gas  (2DEG) 
grown  at  a  GaAs/AlGaAs  heterojunction  [4]. 

In  this  paper  we  point  out  that  the  experimental 
study  of  a  new  class  of  semiconductor  nanostruc¬ 
tures,  the  ID  magnetic  superlattices,  is  now  within 
reach  and  we  present  theoretical  predictions  of 
their  transport  properties. 


We  consider  a  model  of  a  two-dimensional 
ballistic  quantum  wire  shown  schematically  in 
Figure  1 .  This  quantum  wire  has  infinite  length  in 
the  x  direction  and  is  confined  laterally  -Wj 
2  <y<  Wjl  by  hard  wall  potentials.  Inside  the 
channel  the  potential  is  taken  to  be  zero.  Our 
system  consists  of  a  finite  magnetic  superlattice  and 
two  straight  leads.  A  uniform  background  field  B0 
is  applied  to  the  entire  sample,  while  a  finite  section 
of  length  L  =  Na  is  subject  to  an  additional 
periodically  varying  magnetic  field  with  amplitude 
Bm  perpendicular  to  the  plane  [5,  6]. 

For  a  quantum  wire  without  modulation,  it  is 
well-known  that  conductance  is  quantized  in  units 
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FIGURE  1  Schematic  view  of  a  quantum  wire  of  width  W 
with  hard-wall  confinement  in  the  transverse  direction.  A  finite 
section  is  subject  to  a  periodically  varying  magnetic  field,  with 
profile  shown  in  the  lower  graph. 


of  2e2//z ,  due  to  propagation  of  a  set  of  discrete 
modes  through  the  wire.  As  the  magnetic  modula¬ 
tion  is  turned  on,  the  transport  will  be  affected  by  the 
oscillations  of  B(x ),  so  some  change  in  the  character 
of  the  conductance  is  expected.  We  show  in  Figure  2 
the  conductance  of  the  quantum  wire  as  a  function 
of  kFW/7r  for  N=  3,  Bm=\2T  and  Bo  =  03T , 
respectively.  In  the  case  of  weak  modulation,  the 
conductance  still  shows  a  staircase  structure  with 
plateaus  very  close  to  the  quantized  values  2 e2njh  (n 
being  the  number  of  subbands  below  EF)  as  shown 
in  Figure  2(a).  Notice  that  there  are  dips  in 
conductance  just  below  the  threshold  of  each  mode. 
Similar  dips  have  also  been  found  in  the  conduc¬ 
tance  of  perturbed  quantum  wires  subjected  to  local 
potential  disorder,  or  a  potential  barrier,  or  coupled 
to  a  resonant  cavity,  rather  than  magnetic  modula¬ 
tion.  Tekman  and  Bagwell  [7]  and  Berthod  et  al  [8] 
have  argued  that  sub-threshold  dips  like  these  are 
Fano  resonances;  [9]  that  is,  they  are  due  to  coupling 
between  a  discrete  state  and  the  continuum  in  which 
it  is  situated.  They  have  shown  how  the  nearby 
threshold  can  induce  a  discrete  bound  or  anti-bound 
state  and  so  give  rise  to  the  dip. 

The  coupling  between  localized  states  and 
propagating  modes  can  be  revealed  by  the  local 
current  distribution.  In  the  non-magnetic  case,  it 


FIGURE  2  Conductance  as  a  function  of  kFW/n  for  N=  3, 
Bm  =  1.2  T and  B0  0.3  T.  (a)  and  (b)  correspond  to  a  =  0.5 W and 
0.67  IF,  respectively. 

has  been  found  that  vortices  in  the  current  flow 
reverse  their  sense  of  rotation  as  the  energy  is 
raised  through  a  resonance  level  [8,  10].  We  looked 
at  current  flows  at  a  number  of  energies  around 
the  first  conductance  dip  in  Figure  2(a).  Away 
from  the  dip,  the  flow  is  laminar  on  either  side.  In 
a  narrow  energy  window,  close  to  the  dip,  a 
number  of  vortices  appear,  indicating  the  presence 
of  a  localized  state.  In  our  example,  with  three 
periods  of  oscillation,  there  was  one  large  vortex  in 
each  period.  These  move  a  little  as  the  energy  is 
raised,  but  the  sense  of  rotation  did  not  change. 
Rather,  the  dip  was  associated  with  one  vortex 
moving  out  of  the  scattering  region,  leaving  a 
current  pattern  transverse  to  the  channel  along  the 
end  of  the  scattering  region,  accounting  for  the 
reduced  transmission.  Since  the  three  main  vor- 
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tices  are  counter-rotating,  transmitted  electrons 
follow  a  meandering  path  around  them.  The  lack 
of  reversal  is  understandable  as  these  vortices  are 
driven  by  the  applied  magnetic  field. 

When  the  background  field  B0  is  increased,  the 
conductance  dips  shift  to  higher  values  of  kF  W/n 
due  to  the  one-dimensional  subbands  in  the  leads, 
which  gradually  depopulate  as  an  increasing, 
perpendicular  magnetic  field  is  applied  (not  shown 
here).  In  Figure  2(b)  the  length  of  the  period  a  is 
0.61  W.  Lengthening  the  scattering  region  provides 
thicker  magnetic  barriers,  which  tends  to  reduce 
the  transmission.  We  see  that  as  a  increases,  the 
conductance  dips  become  more  pronounced.  We 
also  get  some  conductance  oscillations  between  the 
dips  as  we  increase  a ;  these  are  due  to  multiple 
reflections  within  the  scattering  region. 

The  temperature  dependence  of  the  conductance 
dips  is  shown  in  Figure  2(b).  We  find  that  the 
conductance  dips  are  quite  robust  at  low  tempera¬ 
tures.  The  dotted  line  in  Figure  2(b)  shows  the 
conductance  at  T  =  4  Kelvin.  The  oscillations  due 
to  multiple  reflections  within  the  scattering  region 
disappear,  but  the  conductance  dips  are  still  appa¬ 
rent. 

Figure  3  shows  the  conductance  of  a  modulated 
quantum  wire  with  N=4  and  8  unit  cells, 
respectively.  For  a  wire  with  N=  4  unit  cells  the 
conductance  shows  an  indication  of  miniband 
formation  in  a  periodic  structure.  Although  the 
basic  features  of  the  formation  of  minibands  and 
gaps  are  already  present  for  wires  with  a  few  unit 
cells,  some  narrow  and  fine  minibands  and  gaps 
reveal  themselves  only  for  a  rather  long  modulated 
wire,  as  seen  in  Figure  3  for  the  wire  with  N =8 
periods. 

In  Figure  4  we  show  the  calculated  conductance 
of  a  wire  with  N=16  and  the  individual  transmis¬ 
sion  coefficients  for  each  incident  mode.  The 
conductance  displays  dips  due  to  the  formation 
of  minigaps  and  rapid  oscillations  due  to  electron 
transmission  through  the  coupled  quasi-zero- 
dimensional  states  in  the  cavity  regions  between 
the  magnetic  barriers.  Each  group  of  conductance 
oscillations  evolves  into  a  continuous  miniband  in 


EF(meV) 

FIGURE  3  Conductance  vs  Fermi  energy  EF  for  quantum 
wires  with  N=  4  and  8,  respectively.  Curve  for  N=  8  is  vertically 
offset  for  clarity. 


the  limit  of  an  infinitely  long  superlattice.  The 
formation  of  minibands  and  minigaps  in  a  finite 
lateral  surface  superlattice  has  been  reported  using 
split-gate  structures,  in  which  the  ID  lateral 
surface  superlattice  is  realized  by  a  periodic 
modulation  of  constriction  width  instead  of 
magnetic  modulation  [11]. 

In  the  electric  modulated  case,  the  number  of 
oscillations  in  the  first  conduction  plateau  is  equal 
to  the  number  of  cavity  regions  [12].  However,  in 
our  case  the  channel  is  modulated  by  the  magnetic 
field.  We  can  see  that  there  is  no  simple  direct 
correspondence  between  the  number  of  unit  cells 
and  the  number  of  oscillations.  This  result  shows 
that  the  effective  potential  of  the  magnetic  barriers 
for  electron  motion  in  the  wire  is  complicated  [13]. 
The  oscillations  in  higher  plateaus  are  more 
irregular  because  of  the  strong  coupling  between 
modes.  The  peak  values  of  the  conductance 
oscillations  should  be  an  integer  multiple  of 
le2lh.  Some  of  the  peaks  fail  to  reach  a  quantized 
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FIGURE  4  (a)  Conductance  as  a  function  of  Fermi  energy  EF 
for  a  finite  period  magnetic  superlattice  with  16  unit  cells. 
a  =  90 nm,  W=  100 nm,  B0  =  0  and  Bm=  1 . 5T.  (b)  Transmission 
coefficients  of  individual  modes  for  the  magnetic  modulated 
quantum  wire. 

value  due  to  the  limited  number  of  data  points. 
The  peaks  are  very  narrow  and  therefore  require 
more  extensive  computing.  We  find  that  the 
oscillations  become  more  prominent  as  a  becomes 
longer.  It  is  thus  suggested  that  the  formation  of 
an  effective  loop  of  edge  states  is  essential  for  the 
conductance  oscillations  as  discussed  by  Takagaki 
[14]  and  Yoshioka  [15].  One  is  then  dealing  with  an 
Aharonov-Bohm  type  interference. 

The  conductance,  plotted  in  Figure  4(a),  steps 
up  and  down  between  quantized  levels.  Conduc¬ 
tance  quantization  is  related  to  the  band  structure 
for  the  corresponding  infinite  system  [12].  How¬ 
ever,  none  of  the  individual  modal  transmission 
coefficients  in  Figure  4(b)  shows  quantization  by 
itself.  See  for  example  the  region  near  9  meV.  The 
quantization  occurs  as  the  various  modes  are 
mixed  by  the  periodic  modulation. 


The  calculated  conductance  as  a  function  of  the 
amplitude  Bm ,  is  displayed  in  Figure  5(a).  The 
conductance  steps  down  by  two-units  of  2 e2jh  and 
up  by  one-unit,  then  down  to  zero  as  Bm  is 
increased.  Transmission  coefficients  Tt  of  indivi- 
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FIGURE  5  (a)  The  conductance  as  a  function  of  the  magnetic 
modulation  amplitude  Bm  for  a  quantum  wire  with  16  unit  cells. 
a- 90  nm,  W=  100  nm,  i?0  =  0  and  EF=  12meV.  (b)  Transmis¬ 
sion  coefficients  Tt  of  individual  modes  for  the  magnetic 
superlattice.  The  curves  are  offset  for  clarity,  (c)  The 
conductance  as  a  function  of  the  electric  modulation  amplitude 
Vm  for  a  finite  period  electric  superlattice.  The  other  parameters 
are  the  same  as  (a),  (d)  Transmission  coefficients  7)  of 
individual  modes  for  the  electric  superlattice.  The  solid,  dotted, 
dashed  and  long-dashed  curves  correspond  to  i  =  1,  2,  3  and  4, 
respectively.  The  arrows  A  and  B  indicate  the  cases  correspond¬ 
ing  to  the  current  distributions  shown  in  Figures  6(a)  and  6(b), 
respectively. 
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dual  modes  for  the  magnetic  superlattice  are 
shown  in  Figure  5(b);  the  curves  are  vertically 
offset  for  clarity.  Although  the  total  conductance, 
which  is  the  sum  of  the  modal  transmissions,  is 
essentially  quantized,  the  individual  transmissions, 
plotted  in  Figure  5(b),  show  complicated  depen¬ 
dence  on  Bm. 

We  also  calculated  the  conductance  for  an 
electric  superlattice  with  amplitude  Vm  and  sinu¬ 
soidal  barriers.  The  geometry  of  the  structure  is 
otherwise  the  same  as  that  of  Figure  5(a).  The 
conductance  for  a  ID  electric  superlattice  steps 
down  monotonically  as  the  propagating  modes  in 
the  leads  are  blocked  by  the  potential  barriers  with 
increasing  electric  modulation  amplitude  Vm,  as 
shown  in  Figures  5(c)  and  5(d).  No  two-unit  drop 
is  seen  for  increasing  electric  modulation  ampli¬ 
tude  Vm  in  Figure  5(c).  Clearly,  the  unique  features 
of  the  wave-vector-dependent  effective  potential 
for  the  magnetic  barriers  are  critical  for  the  two- 
unit  drop  in  Figure  5(a).  In  our  calculations,  the 
two-unit  drop  in  conductance  with  increasing  Bm  is 
predominant  but  not  universal;  occasionally  we 
find  cases  with  a  one-unit  drop.  The  reason  for  this 
is  that  the  effective  potential  of  the  magnetic 
barriers  for  electron  motion  along  the  quantum 
wire  depends  on  the  mode  of  the  incident  electron 
in  the  leads  [13]. 

Figures  6(a)  and  6(b)  show  the  current  densities 
in  one  unit  cell  of  the  magnetic/electric  super¬ 
lattices  for  the  cases  labelled  by  A  and  B  in  Figures 
5(a)  and  5(c),  respectively.  Figure  6(a)  corresponds 
to  the  current  pattern  for  the  case  that  there  is  one 
propagating  mode  in  the  sample.  The  current 
pattern  clearly  shows  that  the  Lorentz  force  allows 
electrons  to  move  in  a  meandering  way.  At 
sufficiently  high  magnetic  fields,  the  current  flow 
displays  a  complex  pattern.  Figure  6(b)  shows  the 
current  distribution  for  an  electric  superlattice 
with  sinusoidal  barriers.  Notice  that  in  this  case, 
the  current  flows  straight  across  the  cell.  This 
illustrates  that  electron  transport  in  this  electric 
superlattice  is  equivalent  to  a  one-dimensional 
system,  in  contrast  to  the  magnetic  one,  in  which  it 
is  not. 


FIGURE  6  The  current  density  distributions  (a)  and  (b) 
corresponding  to  the  cases  labelled  A  and  B  indicated  in  Figures 
5(a)  and  5(c),  respectively. 

In  conclusion,  we  have  studied  the  transport 
properties  of  a  quantum  wire  in  spatially  periodic 
magnetic  and  electrostatic  fields.  We  found  a 
conductance  dip,  an  example  of  a  Fano  resonance, 
just  below  each  mode  threshold.  These  conduc¬ 
tance  dips  shift  to  higher  values  of  kF  W  ji r  when 
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the  background  field  is  increased.  As  the  length  a 
of  the  period  increases,  the  conductance  dips 
broaden  and  the  dips  for  higher  modes  become 
more  pronounced.  The  conductance  dips  are  quite 
robust  at  low  temperatures.  We  have  shown  that 
many  features  of  electronic  transport  in  ID 
magnetic,  superlattices  differ  from  those  of  the 
electrostatically  modulated  ones.  We  find  that 
unlike  the  electric  superlattice,  the  number  of 
oscillations  in  the  first  conduction  plateau  has  no 
simple  direct  correspondence  to  the  number  of  unit 
cells  N.  The  conductance  of  ID  magnetic  super¬ 
lattices  does  not  decrease  monotonically  with 
increasing  Bm  but  rather  steps  up  and  down 
between  quantized  levels,  sometimes  going  to  zero. 
The  spatial  distributions  of  current  density  show 
dramatic  differences  between  the  magnetic  and 
electric  superlattices. 

This  complex  behaviour  suggests  that  the 
experimental  study  of  ID  magnetic  superlattices 
should  be  very  interesting.  Several  methods  are 
available  to  produce  such  devices,  including  the 
laying  down  of  patterned  superconducting  strips 
on  top  of  a  gated  device. 
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Boundary  Condition  for  the  Modeling  of  Open-circuited 
Devices  in  Non-equilibrium 
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A  boundary  condition  specifically  designed  to  model  open-circuited  devices  in  a 
macroscopic  device  simulator  is  introduced.  Other  simulation  techniques  have  relied  on 
an  external  circuit  model  to  regulate  the  current  flow  out  of  a  contact  thus  allowing  the 
potential  to  remain  the  controlled  variable  at  the  boundary.  The  limitations  of  these 
methods  become  apparent  when  modeling  open-circuited  devices  with  an  exceptionally 
small  or  zero  output  current.  In  this  case,  using  a  standard  ohmic-type  Dirichlet 
boundary  condition  would  not  yield  satisfactory  results  and  attaching  the  device  to  an 
arbitrarily  large  load  resistance  is  physically  and  numerically  unacceptable.  This 
proposed  condition  is  a  true  current  controlled  boundary  where  the  external  current  is 
the  specified  parameter  rather  than  the  potential.  Using  this  model,  the  external  current 
is  disseminated  into  electron  and  hole  components  relative  to  their  respective 
concentration  densities  at  the  contact.  This  model  also  allows  for  the  inclusion  of 
trapped  interface  charge  and  a  finite  surface  recombination  velocity  at  the  contact. 

An  example  of  the  use  of  this  boundary  condition  is  performed  by  modeling  a  silicon 
avalanche  photodiode  operating  in  the  flux  integrating  mode  for  use  in  an  imaging 
system.  In  this  example,  the  device  is  biased  in  steady-state  to  just  below  the  breakdown 
voltage  and  then  open-circuited.  The  recovery  of  the  isolated  photodiode  back  to  its 
equilibrium  condition  is  then  determined  by  the  generation  lifetime  of  the  material,  the 
quantity  of  signal  and  background  radiation  incident  upon  the  device,  and  the  impact 
ionization  rates. 


Keywords:  Current  boundary  condition,  macroscopic  simulation,  charge  storage,  avalanche 
photodiodes 


1.  INTRODUCTION 

Many  macroscopic  device  simulators  provide  for  a 
connection  to  an  external  circuit  primarily  through 


either  ohmic  or  Schottky  contacts.  In  these  cases, 
the  system  variables  are  constructed  such  that  the 
electrostatic  potential  is  the  independent  para¬ 
meter  describing  the  boundary  condition,  and 
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other  values  such  as  the  current  flowing  through 
the  contact  are  calculated  in  a  post-processing  step 
[1,  2],  These  boundary  conditions  are  adequate  for 
situations  where  the  device  can  be  approximated 
as  a  voltage  controlled  current  source;  however, 
instances  exist  where  this  may  not  be  the  case. 
Most  notably  this  occurs  when  the  energy  stored 
within  the  internal  capacitance  of  a  device  becomes 
the  dominant  controlling  factor  for  the  prediction 
of  the  potential  across  the  device.  In  these  cases, 
the  use  of  a  current  controlled  boundary  is  more 
applicable. 

A  situation  requiring  a  current  controlled 
contact  exists  when  simulating  the  recovery  to 
equilibrium  of  an  open  circuited  device.  Such  a 
condition  occurs  when  examining  the  performance 
of  an  avalanche  photodiode  cpnAgured  in  the 
charge  storage  mode  for  imaging  applications 
[3,  4],  In  this  configuration,  the  sensor  is  periodi¬ 
cally  reset  by  applying  a  large  reverse  bias  to  the 
device.  Following  the  reset,  the  device  is  open- 
circuited  and  begins  to  equilibrate  through  carrier 
generation.  If  the  device  is  used  as  a  photodetec¬ 
tor,  light  shining  upon  the  structure  generates 
excess  charge  which  is  then  stored  within  the 
depletion  region  of  the  device.  At  the  end  of  the 
integration  cycle,  the  quantity  of  stored  charge 
within  the  device  is  used  as  an  indication  of  the 
total  illumination  upon  the  pixel  [5], 

The  present  work  introduces  a  current  con¬ 
trolled  boundary  condition  which  is  capable  of 
simulating  an  electrically  isolated  photodiode. 
With  this  model,  a  steady-state  voltage  is  applied 
between  the  ohmic  contacts  using  the  traditional 
Dirichlet  boundary  condition.  The  device  is  then 
isolated  from  the  external  circuit.  The  voltage 
across  the  device  is  no  longer  fixed  invalidating  the 
use  of  a  voltage  controlled  boundary  condition. 
Alternatively,  a  current  controlled  boundary  con¬ 
dition  must  be  employed  to  describe  the  device 
which  includes  effects  such  as  surface  recombina¬ 
tion,  trapped  surface  charge,  and  leakage  current 
from  the  read-out  electronics.  Application  of  this 
model  to  a  generic  APD  is  also  presented. 


2.  BOUNDARY  CONDITION 


A  current  controlled  boundary  condition  is 
required  to  accurately  model  the  reverse  recovery 
within  a  photodiode.  When  the  diode  is  discon¬ 
nected  from  the  external  circuit  or  driving  bias,  the 
voltage  at  the  cathode  floats.  The  metal  contact 
can  no  longer  be  treated  in  its  usual  manner  as  an 
infinite  source  of  carriers  and  the  assumption  that 
the  electron  and  hole  concentrations  are  dictated 
by  their  equilibrium  values  may  not  be  valid. 
Moreover,  the  voltage  across  the  device  is  deter¬ 
mined  solely  by  the  distribution  of  charge  and  the 
internal  capacitance  of  the  diode  and  not  by  the 
external  circuit.  Simulation  techniques  which 
operate  by  allowing  the  potential  to  remain  the 
independent  variable,  while  acceptable  for  model¬ 
ing  switching  characteristics,  are  inappropriate  for 
cases  with  either  very  low  or  zero  external  current. 
Attempting  to  model  the  open-circuited  device 
with  the  standard  ohmic  condition  and  a  very  large 
shunt  load  resistance  is  unacceptable  on  physical 
as  well  as  numerical  grounds. 

The  primary  concern  with  developing  the  current 
controlled  model  lies  in  the  apportioning  of  the 
electron  and  hole  fluxes  from  the  external  current  at 
the  contact.  It  is  assumed  here  that  the  total  current 
is  divided  into  the  electron  and  hole  currents 
relative  to  their  respective  densities  at  the  contact. 
Thus,  the  relation  between  the  currents  is  given  by, 


n+p 


Jr 


n+p 


'  Jn  —  QR surf 


(1) 


where  J„  and  J p  are  the  electron  and  hole  currents 
leaving  the  contact  as  indicated  in  the  illustration 
of  the  discretized  control  volume  as  shown  in 
Figure  1  [6],  The  carrier  concentrations  are  taken 
to  be  those  of  the  neighboring  control  volume 
within  the  device.  /?surf  is  the  recombination  rate 
owing  to  surface  states  at  the  contact  [7]  using  the 
standard  relation: 


■^surf  — 
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FIGURE  1  Illustration  of  the  sample  control  volume  at  the 
surface  of  the  simulation  domain.  Current  continuity  requires 
the  conservation  of  flux  within  the  control  volume.  Here,  the 
current  boundary  is  entered  directly  into  the  formulation  where 
the  sum  of  the  electron  and  hole  currents  balance  that  of  the 
external  current. 


where  S„  and  Sp  are  electron  and  hole  recombina¬ 
tion  velocities.  Additionally,  Eqs.  (3)  and  (4)  below 
require  that  the  carrier  currents  sum  to  the 
external  current  and  that  the  band  bending  at  the 
contact  is  proportional  to  the  trapped  interface 
charge. 


Jn~^~  Jp  — 


(3) 


m  -n 
£  ~dn  ~ 


(4) 


Combining  Eqs.  (1)  and  (3)  yields  expressions 
setting  the  values  of  the  current  into  a  contact. 


Jp  —  <7^  surf  +  ,  ^ext 

fl  |  JJ 

(5) 

n 

Jp  —  -^surf  +  *  •'ext 

IT  "i  P 

(6) 

These  currents  are  used  directly  within  the  control 
volume  formulation  of  the  discretized  continuity 
equations  [8],  Furthermore,  partial  derivatives  of 
the  current  expressions  with  respect  to  the  system 
variables  are  analytically  obtained  thus  leading  to 
increased  convergence  of  the  non-linear  system. 

A  two-dimensional  analysis  of  the  photodiode  is 
necessary  to  study  the  full  carrier  transport  of 
effects  such  as  premature  edge  breakdown  and 
pixel  crosstalk;  however,  this  study  is  concerned 


solely  with  the  examination  of  the  charge  storage 
capabilities  of  the  APD.  Therefore,  a  one-dimen¬ 
sional  analysis  is  satisfactory.  This  greatly  simpli¬ 
fies  the  model  by  allowing  the  external  current 
density,  Jex „  to  be  directly  proportional  to  the 
cross  sectional  area  of  the  device  contact  and  the 
total  current.  Multi-dimensional  extensions  of  this 
boundary  can  be  produced  by  either  imposing  an 
additional  constraint  upon  how  the  output  cur¬ 
rents  sum  over  the  range  of  the  contact  such  as 
equipartitioning  or  by  solving  Gauss’  law  describ¬ 
ing  the  current  flow  through  the  metal  comprising 
the  contact  [9]. 

To  study  the  charge  storage  of  the  photodiodes, 
the  current  boundary  condition  is  incorporated 
into  a  drift-diffusion  simulator  which  self-consis- 
tently  solves  Poisson’s  equation  and  the  current 
continuity  equations  [10].  Since  the  smallest 
dimensions  of  the  device  structures  are  on  the 
order  of  microns,  it  is  believed  that  hot  carrier 
transport  can  be  neglected  in  these  devices  and 
that  a  full  hydrodynamic  simulation  is  unneces¬ 
sary.  Within  this  model,  the  generation-recombi¬ 
nation  rates  include  terms  for  SRH,  Auger,  and 
radiative  recombination,  impact  ionization,  and 
wavelength  dependent  photoillumination.  Addi¬ 
tionally,  the  standard  field  dependent  mobility  and 
impact  ionization  models  for  silicon  are  incorpo¬ 
rated  into  the  simulator  [1 1].  By  using  a  completely 
numerical  model,  many  of  the  non-linear  attri¬ 
butes  of  the  carrier  transport  of  a  photodiode  in 
charge  storage  mode  can  be  included  without  the 
use  of  many  of  the  simplifying  approximations 
typically  employed  [3,  4]. 

3.  EXAMINATION  OF  CHARGE  STORAGE 
IN  PHOTODIODES 

To  demonstrate  the  utility  of  the  current  con¬ 
trolled  boundary  condition,  the  reverse  recovery  of 
an  avalanche  photodiode  designed  for  night-sky 
imaging  is  examined.  Here  the  device,  shown  in 
Figure  2,  is  a  reach-through  avalanche  photodiode 
50  pm  in  length.  During  the  initial  reset  period,  a 
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FIGURE  2  Doping  profile  and  geometry  of  reach-through 
avalanche  photodiode  used  to  study  the  current  boundary 
condition. 


large  electric  field  is  established  in  the  /7-type 
multiplication  region  and  a  modest  drift  field  exists 
in  the  intrinsic  region.  The  illumination  spectrum 
used  in  this  study  is  that  of  a  cloudless,  moonless, 
night  sky  with  a  total  illumination  of  5.4xl016 
photons  per  cm2  over  a  range  of  wavelengths  from 
0.3  to  1.1  microns  [12].  The  total  internal  quantum 
efficiency  for  this  spectrum  is  approximately  55%. 

If  the  photodetector  was  continuously  biased, 
the  photo-generated  carriers  would  be  swept 
across  the  depletion  region,  collected  at  the  device 
contacts,  and  interact  directly  with  the  external 
circuit.  In  the  charge  storage  mode  of  operation, 
the  connection  to  the  external  circuit  is  removed. 
In  this  case,  the  internally  stored  electric  field  still 
causes  the  carriers  to  be  swept  across  the  depletion 
region  where  they  may  impact  ionize.  However, 
the  charge  is  stored  near  the  edges  of  the  depletion 
region  as  opposed  to  interacting  with  the  external 
circuit.  This  accumulated  charge  counteracts  the 
depleted  charge  from  the  reset  and  tends  to  drive 
the  reverse  recovery.  Additional  factors  which 
promote  the  recovery  of  the  diode  include  the 
leakage  current  inherent  in  the  readout  electronics, 


*^ext?  the  dark  current  due  primarily  to  thermal 
generation  of  carriers  from  SRH  centers,  and 
surface  recombination. 

Figure  3  shows  the  dynamics  of  the  charge 
storage  within  the  APD  for  various  photoillumi¬ 
nation  intensities.  For  this  case,  intensities  ranging 
from  0.05  to  500  times  the  nominal  night  sky 
spectrum  are  considered  (2.7xl015-  2.7xl019 
photons  per  cm2  per  second).  The  initial  bias  on 
the  device  is  established  to  produce  an  initial 
steady-state  gain  of  fifteen.  The  first  part  of  the 
voltage  recovery  is  dominated  by  the  influence  of 
holes  filling  the  intrinsic  region  resulting  in  a  linear 
decay  of  internal  bias  with  only  a  very  small 
degradation  in  gain.  Once  the  drift  region  has  been 
filled,  the  charge  is  stored  at  the  edge  of  the  high 
field  multiplication  region  giving  rise  to  the 
expected  parabolic  voltage  recovery.  A  non-linear 
decrease  in  the  internal  gain  is  observed  as  the  high 
field  region  recovers.  Under  extreme  illumination 
conditions,  the  diode  completely  recovers  to  its 
equilibrium  level  and  begins  to  become  forward 
biased.  Recombination  processes  prevent  the 
diode  from  continued  storage  of  charge  once  the 
diode  becomes  sufficiently  forward  biased. 


0  2  4  6  8  10 


time  (msec) 

FIGURE  3  Calculated  reverse  bias  recovery  of  the  diode 
shown  in  Figure  2  as  a  function  of  integration  time.  The 
different  curves  represent  separate  values  of  photoillumina- 
tion. 
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FIGURE  4  Calculated  stored  electron  charge  of  the  device 
shown  in  Figure  2  as  a  function  of  integration  time  for  various 
illumination  intensities. 


Typically,  the  actual  stored  charge  rather  than 
the  device  bias  is  quantified  at  the  end  of  the 
integration  period  to  determine  the  level  of 
photoillumination  [5].  Figure  4  illustrates  the  total 
electron  charge  stored  within  the  diode  as  a 
function  of  the  integration  time.  As  expected,  the 
first  part  of  the  integration  gives  rise  to  a  linear 
collection  of  charge  with  a  constant  gain.  As  the 
amount  of  charge  increases,  the  gain  begins  to 
saturate  with  a  square  root  dependence  [10,  13]. 
Once  the  device  has  recovered  to  the  point  where 
internal  gain  becomes  negligible,  the  linear  recov¬ 
ery  is  again  observed.  As  the  diode  becomes 
significantly  forward  biased,  as  in  an  open- 
circuited  solar  cell,  the  charge  saturates. 


4.  CONCLUSIONS 

A  current  controlled  boundary  condition  applic¬ 
able  to  macroscopic  device  simulators  is  presented. 
This  boundary  type  establishes  the  systems  vari¬ 
ables  such  that  the  external  current  becomes  the 
independent  quantity  at  the  contact,  thus  allowing 
the  contact  potential  to  be  self-consistently  set 
within  the  simulation.  As  an  example  of  this 


model,  a  reach-through  avalanche  photodiode, 
operating  in  the  charge  storage  mode,  is  examined. 
Here  it  is  observed  that  the  model  allows  one  to 
capture  the  full  transient  recovery  of  the  device 
from  the  initial  reset  point  all  the  way  through  the 
forward  bias  condition  in  the  case  of  extreme 
photoillumination. 
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may  be  typewritten  on  a  separate  page,  with  their  designated  positions  specified  in  the  text.  For 
manuscripts  with  extensive  equations,  TFX  is  the  preferred  word  processing  program.  To  simplify 
typesetting  of  non-Tj.X  files,  please  use:  (1)  the  “exp”  form  of  complex  exponential  functions;  (2) 
fractional  exponents  instead  of  root  signs;  and  (3)  the  solidus  (/)  to  simplify  fractions— e.g.,  exp  x1/2. 

Marking:  The  typesetter  will  set  mathematical  symbols  in  italics,  except  for  obvious  groups  like  sin  and 
log.  Any  symbols  which  are  left  in  roman  (upright)  type  should  be  circled' in  pencil  in  the  typescript  and 
bold  symbols  should  be  underlined  with  a  wavy  line. 

Units:  Metric  units  should  be  used,  with  inch/pound  equivalents  in  parantheses. 

REFERENCES 

Indicate  references  in  the  text  by  consecutive  arabic  numerals  enclosed  in  square  brackets,  viz.  De  Pauw 

[1].  All  sources  cited  in  the  text  must  be  included  in  the  reference  list,  and  vice  versa.  The  full  list  of 
references  should  appear  on  a  separate  page  at  the  end  of  the  paper.  Below  are  examples  of  entries  for  a 
journal,  a  proceedings,  and  a  book: 

[1]  B.  Moszkowski,  “A  Temporal  Logic  for  Multilevel  Reasoning  about  Hardware,”  Computer  Magazine 
February  1985. 

[2]  M.C.  McFarland,  A.C.  Parker  and  R.  Camposano,  “Tutorial  on  High-Level  Synthesis,”  Proc.  of  25th 
Design  Automation  Conf,  1988. 

[3]  M.  Born  and  E.  Wolf,  Principles  of  Optics,  Oxford:  Pergamon  Press,  Sixth  Ed.  1986. 

It  is  assumed  that  with  the  development  of  the  World  Wide  Web  (WWW),  authors  and/or  the  publisher 
will  propose  distribution  of  articles  or  parts  of  articles  on  the  WWW.  If  the  HTTP  address  of  a  referenced 
article  is  known,  please  include  it  at  the  end  of  the  reference  in  the  following  style: 

<  www  http://www.blouk.com/article.html  > 
where  http://www.blouk.com/article.html  is  the  HTTP  address. 

BIOGRAPHY 

At  the  end  of  the  manuscript  please  include  a  one-paragraph  biographical  sketch,  100  words  maximum, 
highlighting  your  education,  experience,  and  other  qualifications  for  writing  this  particular  paper.  There 
should  be  one  biographical  sketch  for  each  author.  Material  submitted  may  be  edited  by  the  publisher. 

The  following  information  is  needed  (for  each  author):  (1)  Current  research  interests,  or  what  is 
currently  being  worked  on;  (2)  Pertinent  extracurricular  activities— for  example,  do  you  sit  on  any 
committees  or  have  you  chaired  any  symposia?;  (3)  Books  or  articles  that  you  have  written/are  writing 
that  are  especially  noteworthy;  (4)  Professional  memberships  or  honors;  (5)  Current  position;  (6)  Any 
electronic  network  connection  (i.e.,  Bitnet). 
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PRADIP  K.  SRIMANI  has  served  on  the  faculty  of  the  Indian  Statistical  Institute,  Calcutta;  Gesselschaft 
fuer  Mathematik  und  Dalenverarbeitung,  Bonn,  Germany;  the  Indian  Institute  of  Management,  Calcutta; 
and  Southern  Illinois  University,  Carbondale.  Currently,  he  is  a  Professor  of  Computer  Science  at 
Colorado  State  University  in  Ft.  Collins.  His  research  interests  include  parallel  algorithms,  distributed 
operating  systems,  software  reliability,  fault-tolerant  computing,  networks  and  graph  theory  applications. 
He  is  a  senior  member  of  the  IEEE,  and  member  of  ACM,  IEEE  CS  Press  Editorial  Board  and  the 
Editorial  Board  of  the  International  Journal  of  Computer  Simulation. 


words  capitalized.  Underline  the  heading  and  start  the  text  on  the  next  line.  For  third-level  headings,  only 
the  first  letter  should  be  a  capital.  Underline,  then  begin  text  after  three  spaces. 

FIRST-LEVEL  TEXT  HEADINGS 

Second  —  Level  Text  Headings 

Third  -  level  headings  With  text  run  on 


FIGURES  AND  CAPTIONS 

Please  do  not  intersperse  figures  (line  drawings,  photos,  computer  printouts)  throughout  the  text,  but 
group  them  together  at  the  end  of  the  manuscript.  Limit  figures  to  one  per  page. 

Number  all  figures  consecutively  with  arabic  numerals  on  the  back  of  the  camera-ready  drawings,  and 
include  the  author’s  name  (Smith,  Figure  1;  Smith,  Figure  2;  etc.)  and  paper  title  (abbreviated).  Please 
indicate  “top”  where  this  is  not  obvious.  Be  certain  to  refer  to  each  figure  in  the  text,  e.g.,  (See  Figure  2)  as 
this  is  essential  to  efficient  searching  of  articles  on  electronic  delivery.  In  addition,  please  indicate  where 
the  typesetter  should  position  artwork  by  marking  and  circling  the  first  call-out  to  each  figure  in  the 
manuscript  margin. 

Please  include  a  separate  sheet  containing  a  list  of  figure  captions  which  should  be  substantively  one  or 
two  sentences.  Captions  will  be  set  in  type  by  the  typesetter. 

The  original  figures  and  one  copy  should  be  sent  when  the  article  is  accepted. 

Preparation 

Figures  submitted  must  be  of  a  high  enough  standard  for  direct  reproduction,  with  clear,  sharp,  black 
lines  on  a  white  background.  Figures  should  be  drawn  on  smooth  white  paper  or  computer  generated  and 
reproduced  by  a  high-quality  laser  printer  or  plotter.  Letters  and  numbers  should  be  drawn  professionally 
with  a  LeRoy  Lettering  Set  and  with  clear  black  ink.  No  additional  redrawing  of  artwork  will  be  done. 
Note  that  the  smallest  correction  will  require  reshooting.  Therefore,  check  carefully  for  accuracy  to  avoid 
this  costly  and  time-consuming  procedure. 

Photographs:  Photographs  intended  for  halftone  reproduction  must  be  good  glossy  original  prints,  of 
maximum  contrast. 

Computer  Printouts:  A  computer  printout  of  more  than  several  lines  should  not  be  embedded  in  the 
manuscript,  but  treated  as  a  figure.  The  printout  should  be  labeled  consecutively  along  with  the  line 
drawings  (e.g.,  “Figure  4”),  be  referred  to  in  the  text,  and  be  placed  at  the  end  of  the  manuscript  with  the 
other  figures.  Because  the  printout  will  be  photographed  as  line  art,  it  should  be  a  clear,  sharp  printout 
produced  on  a  high-quality  laser  printer. 

Size:  Illustrations  should  be  prepared  to  fit  either  into  a  single  column  width  (75  mm)  or  double  column 
width  (160  mm)  with  capital  lettering  4  mm  high,  to  accommodate  reduction.  Photographs  for  halftone 
reproduction  should  be  about  twice  the  desired  size. 

TABLES 

Number  tables  consecutively  with  roman  numerals,  and  give  each  a  clear  descriptive  caption  at  the  top.  Be 
sure  to  refer  to  each  table  in  the  text;  for  example,  (See  Table  I),  and  indicate  in  the  margin  where  the 
typesetter  should  place  tables.  Limit  tables  to  one  per  page.  Avoid  the  use  of  vertical  rules  in  the  tables. 

MATHEMATICAL  EQUATIONS 

Wherever  possible,  mathematical  equations  should  be  entered  in  the  word  processing  program,  with 
unusual  or  ambiguous  symbols  identified  in  the  margin  where  they  first  occur.  Alternatively,  equations 
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2.  forward  the  original  permission  letter  to  the  Editor  in  Chief,  Dr.  George  W.  Zobrist, 
immediately  upon  receipt  and  retain  a  copy  for  your  own  records; 

3.  credit  the  source  (e.g.,  Reprinted  from...:  Adapted  from...). 

ANY  TRADEMARKS  THAT  ARE  USED  MUST  BE  LISTED  IN  THE  ACKNOWLEDGMENTS  SECTION. 
Paper  Organization 

Front  Sheet:  Title,  Author(s),  Affiliation(s),  Address(es),  Phone  No.(s),  Email  Address(es),  FAX 
No.(s). 
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REMEMBER  THE  ORIGINALS  OF  THE  FIGURES/TABLES/COMPUTER  PRINTOUTS  ARE 
NOT  TO  BE  PLACED  WITHIN  THE  MANUSCRIPT,  BUT  MUST  BE  PLACED  AT  THE  END  OF 
THE  PAPER,  ONE  TO  A  PAGE.  THERE  ALSO  MUST  BE  A  FIGURE/TABLE/COMPUTER 
CAPTION  LIST  IN  FRONT  OF  THE  ORIGINALS. 

Language:  English  only  please. 

Length:  The  text  of  a  full-length  manuscript  should  be  between  15  and  40  double-spaced  pages,  plus 
artwork  grouped  at  the  end.  The  journal  will  also  accept  short  technical  notes  of  10-15  manuscript  pages, 
including  artwork. 

Abstract:  On  the  first  page  of  the  manuscript,  following  the  title  and  author  byline  (name,  title, 
organization,  city,  state,  country)  please  include  a  100  to  150  word  abstract  summarizing  your  problem 
and  findings. 

The  abstract  should  be  as  informative  as  possible,  consisting  of  short,  direct,  and  complete  sentences. 
Length  must  be  less  than  150  words.  The  abstract  should  state  the  objectives  of  the  work,  summarize  the 
results,  and  give  the  principal  conclusions  and  recommendations.  It  should  state  clearly  whether  the  focus 
is  on  theoretical  developments  or  on  practical  questions,  and  whether  subject  matter  or  method  is 
emphasized.  The  title  need  not  be  repeated.  Work  planned  should  not  be  described  in  the  abstract. 
Because  abstracts  are  extracted  from  a  paper  and  used  separately,  one  should  not  use  the  first  person, 
display  mathematics,  nor  use  citation  reference  numbers.  Avoid  starting  with  the  words  “This  paper....” 

Keywords:  On  a  line  below  the  abstract  please  include  six  key  words  for  indexing  and  database  word 
searches. 

Text  Headings 

Set  first-level  headings  in  the  text  to  the  left,  typed  all  in  capitals  (upper  case);  begin  the  text  on  the 
following  line.  Second-level  headings  should  be  typed  in  small  (lower  case)  letters  but  with  all  the  main 


FOOTNOTES 

Authors  are  encouraged  to  minimize  the  use  of  footnotes.  A  footnote  may  include  the  designation  of  a 
corresponding  author  of  the  paper,  current  address  information  for  an  author  (if  different  from  that 
shown  in  the  affiliation),  and  traditional  footnote  content.  Information  concerning  grant  support  of 
research  should  appear  in  a  separate  Acknowledgments  section  at  the  end  of  the  paper,  not  in  a  footnote. 
Acknowledgments  of  the  assistance  of  colleagues  or  similar  notes  of  appreciation  also  properly  belong  in 
an  Acknowledgments  section,  not  in  footnotes. 

Footnotes  should  be  indicated  in  the  text  by  the  following  symbols:  *  (asterisk  or  star),  f  (dagger), 
J  (double  dagger),  f  (paragraph  mark),  §  (section  mark),  ||  (parallels),  #  (number  sign).  Do  not  use 
numerals  for  footnote  call-outs,  as  they  may  be  mistaken  for  bibliographical  reference  call-outs  or 
exponents.  Type  each  footnote  at  the  bottom  of  the  typescript  page  on  which  its  text  call-out  appears. 

Footnotes  within  a  table  should  be  indicated  by  the  same  symbols  listed  above.  Reinitialize  symbol 
sequence  within  tables. 

SUBMISSION  OF  ARTICLES  ON  DISKETTE 

The  publisher  encourages  authors  to  submit  accepted  manuscripts  on  computer  diskette.  Authors  must 
forward  to  the  Editor  in  Chief  a  hard  copy  of  the  manuscript,  in  triplicate,  along  with  the  diskettes.  The 
diskette  files  and  hard  copy  must  match  exactly.  Please  refer  to  the  back  of  the  journal  for  instructions  on 
submission  of  articles  on  diskette.  These  are  also  available  from  the  journal  Editor  in  Chief  or  the 
publisher. 

SUBMISSION  OF  ANIMATION 

Author-supplied  animation  related  to  articles  accepted  for  publication  will  be  included  in  the  journal  CD- 
ROM  and  published  on  the  World  Wide  Web  at  no  cost  to  authors.  Animations  are  limited  to  a  time 
duration  of  30  seconds.  Animation  should  be  submitted  to  the  journal  Editor  in  Chief  with  the  final 
manuscript,  after  it  has  completed  the  refereeing  process. 

Animations  in  the  following  forms  (in  order  of  preference)  can  be  accepted  from  authors: 

•  Video  tape 

•  AVI  or  QuickTime  files 

•  A  sequence  of  still  images 

The  following  formats  can  be  accepted: 

•  all  uncompressed  formats  widely  used  on  PC,  Mac  and  UNIX 

•  JPEG  for  colored  and  compressed  images 

•  TIFF  with  a  group  IV  compression  for  black  and  white  compressed  images 

•  FLI  and  FLC  format  from  AutoDesk 

Authors  who  submit  animations  are  requested  to  provide  the  following  information: 

•  Video  tape  -  format  used 

•  AVI  or  QuickTime  files  -  version  and  system  used  for  disk  file  creation 

•  Sequence  of  still  images  -  format  used,  version  and  system  used  for  disk  file  creation 

Authors  who  are  unable  to  supply  video  tape,  AVI  or  QuickTime  files  may  provide  the  publisher  with  a 
set  of  sequential  still  images.  Note  than  an  animated  sequence  will  consist  of  13  to  15  still  images  per 
second  of  animation;  e.g.,  if  an  animated  sequence  is  10  seconds  in  duration,  it  is  made  up  of  130  images. 
Authors  who  are  unable  to  submit  in  any  of  the  above-mentioned  formats  are  advised  to  contact  the 
publisher  to  discuss  other  options  prior  to  submission. 


Animation  should  be  mentioned  in  the  text.  Indicate  an  approximate  location  for  the  animation  call-out 
in  the  margin. 

PROOFS 

The  first-named  author  will  be  airmailed  proofs  (including  figures)  for  correction.  These  must  be  air¬ 
mailed  back  to  the  typesetter  within  48  hours  of  receipt.  Please  include  a  full  postal  address  on  the  first 
page  of  the  manuscript  so  proofs  are  not  delayed.  Author’s  alterations  in  excess  of  10%  of  the  original 
composition  cost  will  be  charged  to  the  author. 

REPRINTS 

Twenty-five  complimentary  reprints  will  be  sent  to  the  first-named  author  of  each  paper.  Additional 
reprints  may  be  ordered  by  completing  the  appropriate  form  sent  with  proofs. 

PAGE  CHARGES 

There  are  no  page  charges  to  individuals  or  to  institutions.  See  the  journal  inside  back  cover  for 
information  on  the  publisher’s  negative  page  charge  voucher  and  voluntary  page  charge  programs. 

A  FINAL  NOTE 

Typescripts  that  do  not  conform  to  the  required  standard  of  preparation  for  submission  outlined  here  will 
be  returned  to  the  authors  for  correction  before  review. 


INSTRUCTIONS  for  AUTHORS 


Article  Submission  on  Disk 

The  Publisher  welcomes  submissions  on  disk.  The  instructions  that  follow  are  intended  for  use  by 
authors  whose  articles  have  been  accepted  for  publication  and  are  in  final  form.  Your  adherence  to  these 
guidelines  will  facilitate  the  processing  of  your  disk  by  the  typesetter.  These  instructions  do  not  replace 
the  journal  Notes  for  Contributors;  all  information  in  Notes  for  Contributors  remains  in  effect. 


When  typing  your  article,  do  not  include  de¬ 
sign  or  formatting  information.  Type  all  text  flush 
left,  unjustified  and  without  hyphenation.  Do  not 
use  indents,  tabs  or  multi-spacing.  If  an  indent  is 
required,  please  note  it  by  a  line  space;  also  mark 
the  position  of  the  indent  on  the  hard  copy  manu¬ 
script.  Indicate  the  beginning  of  a  new  paragraph 
by  typing  a  line  space.  Leave  one  space  at  the  end 
of  a  sentence,  after  a  comma  or  other  punctuation 
mark,  and  before  an  opening  parenthesis.  Be  sure 
not  to  confuse  lower  case  letter  “1”  with  numeral 
‘T\  or  capital  letter  “O”  with  numeral  “0”.  Dis¬ 
tinguish  opening  quotes  from  close  quotes.  Do  not 
use  automatic  page  numbering  or  running  heads. 

Tables  and  displayed  equations  may  have  to  be 
rekeyed  by  the  typesetter  from  your  hard  copy 
manuscript.  Refer  to  the  journal  Notes  for  Con¬ 
tributors  for  style  for  Greek  characters,  variables, 
vectors,  etc. 

Articles  prepared  on  most  word  processors  are 
acceptable.  If  you  have  imported  equations  and/ 
or  scientific  symbols  into  your  article  from  an¬ 
other  program,  please  provide  details  of  the  pro¬ 


gram  used  and  the  procedures  you  followed.  If 
you  have  used  macros  that  you  have  created, 
please  include  them  as  well. 

You  may  supply  illustrations  that  are  available 
in  an  electronic  format  on  a  separate  disk.  Please 
clearly  indicate  on  the  disk  the  file  format  and/or 
program  used  to  produce  them,  and  supply  a  high- 
quality  hard  copy  of  each  illustration  as  well. 
Submit  your  disk  when  you  submit  your  final  hard 
copy  manuscript.  The  disk  file  and  hard  copy  must 
match  exactly. 

If  you  are  submitting  more  than  one  disk,  please 
number  each  disk.  Please  mark  each  disk  with  the 
journal  title,  author  name,  abbreviated  article  title 
and  file  names. 

Be  sure  to  retain  a  back-up  copy  of  each  disk 
submitted.  Pack  your  disk  carefully  to  avoid  dam¬ 
age  in  shipping,  and  submit  it  with  your  hard  copy 
manuscript  and  complete  Disk  Specifications  form 
(see  reverse)  to  the  person  designated  in  the  jour¬ 
nal  Notes  for  Contributors. 
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